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Sir: 



ebruary 26, 2003 



I, Andreas Rump of the metaGen Pharmaceuticals GmbH, Gbrmany, do hereby 
declare the follo\ vin| ; : I 



I have attached a eopy of my curriculum vitae to Ms Declar4tion. 

* 
i 

I am Research Scientist, of the Oncology Department and hajve conducted the 
c^eximents described below. i 

I am an co-inventor of the above referenced patent applicatioin and am familiar 
with the development, usages and properties of LOBO ("img-bone"!) proteins described 
therein. 



I have read and understand the subject matter of the Office Ajction of September 



3, 2002. 
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The folio wing comments are offered in support off jftie patentability of the instant 

i 

invention. 



I have reformed two types of searches to demonstfate the sijnilarity between the 
murine LOBO protein and the human homolog LOBO protein described in the instant 
invention. In the irst search, I used LALIGN to coinpa^Wh the pj-otein and the DNA 
sequences of the rcuiine and human LOBO sequences. Thiis search was done using the 
complete sequent o f theimurine LOBO protein/DNA disposed in tjie apphcation and 
the partial sequence of the human as known at the time of filing as well as the complete 
sequence as determined after the filing date of the instantiatppHcatioi LALIGN compares 
two protein or DNA sequences for local similarity and shows the locjal sequence 
ahgnments, LALIGN uses code developed by X Huang iajid W. Miller (Adv. Appl. 
Math. (1991) 12: 337 -357) for the "sim* program. LALMN will report a specified 
number of ahgnments (the default is 10) between the two Sequences and their scores. 
The results of these comparisons are attached hereto as Appendices I (comparison of the 
complete mouse sequences v. partial human sequences Mown at the time of filing) and II 
(comparison of the complete mouse sequences v. complete human sequences as 
determined after the application was filed) . i 



The resuks ir. Appendix I indicate that the sequeaib identity ^>etween the LOBO 
Coding Sequences o:: the Mouse (complete) v. Man (pardaji) was S5h%. The sequence 
identity betweenrthe Mouse (complete) v. Man (partial) inlthe LOB(|) Protein- Alignment 
was 88. 1% and tiie protein sequence similarity was 94.8%i The resets in Appendix II 
indicate that the sequence identity for the LOBO Coding Sequences of the Mouse 
(complete) and Man (complete) was 85.9%. The sequence identity fcetween the Mouse 
(complete) and Man (complete) in the LOBO Protem-ALignment wajs 88.3% and the 
protein sequence similarity was 96.2%. The results in Appendix II jvhich compares the 
Mouse (complete:) tc Man (complete) further demonstrate^ the high jdegree of 
conservation between the murine and human DNA and ptotein sequences. 
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I also performed a BLAST search (See, Altschul, & et al. (1^97) "Gapped 
BLAST and PSI-BLAST: a new generation of protein database search programs", 
Nucleic Acids Res. 25:3389-3402) to locate similar sequences usingjas query the 
complete mouse LOBO pfotein sequence shown as SEQ I© No: 9 ir^the present 
application (see Anrex 1 ; 870 amino acid residues). The blast search 1 was done via the 
NCBI website rhrf p //www.ncbi "'™ n«h gov/BLAST/) egSinst the complete translated 
GenBank databa^on January 29, 2003 (blastP). The complete results are summarized in 
Table 2 , where the Subject ID, accession number, brief description cjf the subject, a bit 
score and e-value are listed. This list is followed by an alignment of| the query and 
subject sequences. Each alignment indicates the percent identity an| percent similarity 
between the two aligned sequences. Table 1 lists the ten sequences jiaving the most 

i 

significant identities. j 

The blast? search; Protein query-Translated db [WastP] was lione using the 
program BLASTP 2.2.5 (version from Nov- 16-2002) willirthe folio jing default 
parameters: 

t 
i 

! 

I 

a. Databases searched: non-redundant = nr (All non^tedundant penBank CDS 
ti-anslations + PDB =[Sequences derived from the ; 3 -dimensional structure from 
Brookha^en Protein Date Bank] + SwissProt = flist major r elease of the SWISS- 
PROT protein sequence database (no updates)] + filR + PRF) 

b. Limit by emteez query: No Imitations were made.which meabs all subsets of the 
above-cited databases were searched without any;e*clusions[ 

c. Filter used: Low complexity. Filtering can eliminate statistically significant but 
biologically ^interesting reports from the blast output (e.g.j hits against common 
acidic-, bask- or proline-rich regions), leaving the more biojogically interesting 
regions of tha query sequence available for specif*: matching against database 
sequences.. Furthermore, in some cases, sequencsskare masked in their entirety, 
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indicating; tbfct the; statistical significance of any notches reported against the 
unfiltered query sequence should be suspect. j 

d. Expect vflue = 10. This is the statistical significiisute threshojd for reporting 
matches 2-gamst database sequences. When the ddftult value jis 10, only 10 
matches are expected to be found merely by chance, according to the stochastic 
model of Karlin and Altschul (1990). If the statistical significance ascribed to a 
match is sweater than the EXPECT threshold, the iriatch will Lot be reported. 
Lower EXPECT thresholds are more stringent, leading to fe^ver chance matches 
being reported. Increasing the threshold shows le&sstringentjmatches. Fractional 
values arc acceptable. | 

e. Word size - 3 . This sets the size of the short regioris of similarity between 
sequences for which BLAST initially searches. 

f. Matrix = BLOSUM62; Gap Costs; Existence = 1 3;j Extensioii = 1 . Here, the 
theory of amino acid substitution matrices is described in [1] , and applied to DNA 
sequence comparison in [2]. In general, different substitution matrices are tailored 
to detecting similarities among sequences that a*& diverged By differing degrees 

[1-3]. A single matrix may nevertheless be reasonably efficient over a relatively 

j 

broad range of evolutionary change [1-3]. Experimentation has shown that the 

j 
i 

BLOSUM-62 matrix [4] is among the best for detecting mo^t weak protein 

i 

similarities. For particularly long and weak aligamfents, the £LOSUM-45 matrix 
may prove superior. A detailed statistical theory fo!r gapped alignments has not 
been developed, and the best gap costs to use withia given substitution matrix are 
determined empirically. j 

[1] Fitch, W.M- (1983) "Random sequences. u J. Mol. bIoL 163:171-176 

[2] Lipman, DX, Wilbur, W J, Smith T.F. & Waterrnarj, M.S. (1984) "On the 

statisticd significance of nucleic acid sirnilarities." Nucj. Acids Res. 12:215- 

226. : ! j 

[3] Alts<;hul,;S.F. & Erickson, B.W. (1985) ^gnificanJe of nucleotide 

sequence alignments: a method for random skfjuence permutation that 

prestSFves dinucleotide and codon usage." Mol Biol. Evol. 2:526-538. 
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[4] Dekea, J. : (1983) "Probabilistic behavior ofelongest-cprnmon-subsequence 
lengfch;" ta "Time Warps, String Edits and Maaromolecujles: The Theory and 
Practice of Sequence Comparison/ D. Sankoff & J.B. Kjruskal (eds.)> PP- 55 " 
91, Addison-Wesley, Reading, MA. j 
NCBI Blast searches have for years been a routine land standard tool used by 
scientists for identifying similarities between sequences. :phe following citation from 
NCBI (j^ri^g^^ ^l^ih.gov mT • A CT/hfast overdew.htmfl attests to this fact: 
"BLAST® (Basic Local Alignment Search Tool) is a set of simUaritylsearch programs 
designed to explore all of the available sequence databases regardless of whether, the 
query is protein or LWA . ; The BLAST programs have beekdesigned^for speed, with a 
minimal sacrifice of sensitivity to distant sequence relationships. Th\ scores assigned in a 
BLAST search have a weft-defined statistical interpretation, making real matches easier 
to distinguish from random background hits. BLAST us d\ a heuristih algorithm which 
seeks local as opposed to global alignments and is therefore able to\ detect relationships 

among sequences which share only isolated regions of similarity. " j 

■ i 

Using tbes above described standard blast procedure and SEQ ID No.: 9 of the 
present invention as a query, I obtained the following resjits: The hit showing the highest 
degree of identity m the blastP search performed was database identification No gi 
25019857 (accession XP!_1 10318.2). This protein is idenifesd as bejing similar to 
hypothetical pro*in MGC37640 fiom J*, meat* ancj had a 97^ sequence identity 
and a 97% sequence similarity. The murine LOBO proteijq of the present invention and 
XP_1 10318.2 lack identity for only 16 amino add residues. This is likely the result of, 
for example, irresolvable secondary structure encountereilldurmg ths sequencing process. 
Based on this alignment'I believe that XP_11031 8.2 and SEQ ID Vp. 9 describe one and 
the same sequence having equivalent functions. The nineiother hitsjhaving a degree of 
identity greater than 70% are listed in Table 1. All of fc*e proteins; were of human or 
mouse origin. White the identity values for the last tbxeej Sequences) listed in Table 1 
(Accession Nos. BA.C03400.1, AAH301 13.1 and XP_23p49.1) hajre sequence identities 
ranging from 88% to 91 %, these sequences represent a csDjuparison bf only 128-188 
amino acids of the query: sequences. 
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The results of the blastP search are ordered by thfe value ofj % identity (see 
Table 2). The nex* "best? hit, after the 10 noted above, the blastp] search showed 37% 
identity to the protein sequence of SEQ ID NO. : 9. This iaext best hjt in the blastP search 
has the identification No.igi 21292324 (equal to accession (number SAA04469.1), which 
is defined as agCP32T2 [Anopheles gambiae str. PEST]. The ahgnrW of the 
polypeptide translated from gi 21292324 with SEQ ID NCI: 9 result^ in the value of 
37% identity and accounted for 794 amino acid residues. In conclusion there are no 
proteins appearicg-ir. the blastP search having more than 7D% identity to the amino acid 

sequence of SEQ ID NO.S (murine LOBO protein). j- j 

!. I 

All of che closest hits obtained in the blastP search, as identified in Table 1, 
were entered mo the BLAST database after the fribjg date of the above -identified 
application. F^bsrmore, no function has been attributed toj these sequences as 
illustrated by ttefr descriptions, i.e. "hypothetical proton", etc, I believe that these 
proteins will have the same basic function as the raJirine LOBO protein given the 
significant degree-of sequence identity and sequence sinnjferity. j 



The undersigned hereby declares that all statements made herein based upon 
knowledge are true, .and that all statements made based up?m information and belief are 
believed to be tee; £nd further, that these statements werefmade Wiethe knowledge that 
willful false statements and the like so made are punishtilijle by fine cbr imprisonment, or 
both, under Section 1 00 li of Title 18 of the United States .jjode, and jhat such willful false 
statements may jsop-ardia; the validity of the application soir any patejnt issued thereon. 



DATED 




Dr. Andreas Rump 




Enclosures: As stated abpve 



Dr. Andreas Rump 
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TBJlOfWB*** 

Personal Details 

• Date of birth: Sept 7, 1960 

• Place of birth: Hamm, Germany 

• Nationality: German 

• Family status: married, 5 children 

• Private Address: 
Andreas Rump, Leutraer Weg 3 a, 07745 Jena 
@ +49-3641-212866 
H andreas rump@hotmail.com 

Current place of work 

• Since December 2000 

Employed at metaGen Pharmaceuticals, a company dedicated to the 
identification of potential targets suitable for the treatment of solid 
tumors. Position: Scientific Leader of the "genomic technologies group"; 
involved in large-scale annotation of genomic DNA, cloning and 
expression of full-length cDNAs, DNA sequence analysis and early 
validation of target genes. 

Previous places of work 

• January 1996 - November 2000 (5 years) 

Research Scientist at the Institute of Molecular Biotechnology (1MB), 
Dept. of Genome Analysis, headed by Prof. Dr. Andre Rosenthal. 
Areas of work: large-scale sequencing and molecular characterization of 
mouse mutants with defects in bone development. 

• July 1994 -December 1995 (1.5 years) 

PostDoc at the Institute of Neuropathology in Heidelberg, headed by Prof. 
Dr. Marika Kiessling. 

Area of work: expression of glutamate receptors after global cerebral 
ischemia. 




• July 1991 - June 1994 (3 years) 

PostDoc at the German Cancer Research Center (DKFZ) in Heidelberg. 
Department: „Interactions of carcinogens with biological macromole- 
cules", headed by Prof. Dr. Dr. Heinz- Walter Thiebnann. / 
Area of work: molecular characterization of the basal cell nevus 
syndrome. 

Postgraduate education 

• July 1987 - June 1991 (4 years) 

Ph.D. student within the Dept. of Molecular Genetics, at the German 
National Research Center for Environment and Health (GSF). 
Ph.D. thesis: "Molecular characterization of the pythopathogenic fungus 
Phytophthora parasitica: construction of a representative genomic cosmid 
library and identification of pathologically relevant genes". 

Undergraduate education 

• October 1980 - June 1987 (7 years) 

Student at the Institute of Biology at the University of Bielefeld, 
Germany. Diploma thesis from the Dept. of Molecular Biology and 
Genetics, headed by Prof. Dr. Alfred Puhler. Title of thesis: "Nucleotide 
sequence of the nifA gene of Klebsialla pneumoniae M5al". 
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Pseudoautosomal deletions encompassing a novel homeobox gene cause 
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Winkelmann M, Nordsiek G, Heinrich U, Breuning MH, Ranke MB, Rosenthal A, Ogata T, 
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Rump A, Karlovsky P. 
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Nucleotide sequence of a 24,206-base-pair DNA fragment carrying the 
entire nitrogen fixation gene cluster of Klebsiella pneumoniae 
Arnold W, Rump A, Klipp W, Priefer UB, Piihler A. 



APPENDIX f 



LOBO coding sequence (CDS)-alignntent: Mouse (complete) vs. Man (partial, as 
known at the time of patent filing) 

Region of overlap 1521 nt; Identiy [:] 85.2 % 
Program used: LALIGN (rigorous local alignment) 

1120 1130 1140 1150 1160 1170 

MM -LOB AAAGACTGTATCTTCACCATTGATCCATCAACTGCTCGCGACCTTGATGATGCCCTCGCC 



HS - Lob AAAGACTGTATCTTCACCATTGACCCATCAACCGCCCGAGACCTCGATGATGCCCTCTCC 

10 20 30 40 50 60 

' 1180 1190 1200 1210 1220 1230 

MM - LOB TGCAGGCGGCTCACTGATGGCACCTTCGAAGTGGGCGTCCACATCGCCGATGTGAGTTAC 



HS - Lob TGCAAGCCACTCGCTGACGGCAACTTCAAAGTGGGAGTTCACATTGCTGACGTGAGTTAC 

70 80 90 100 110 120 

1240 1250 1260 1270 1280 1290 

MM- LOB TTTGTTCCTGAGGGATCCTCTTTGGATAAAGTAGCTGCTGAGAGAGCCACAAGTGTCTAC 



HS - Lob TTTGTTCCGGAGGGATCTGATCTGGATAAAGTGGCTGCCGAGAGGGCTACAAGCGTCTAC 

130 140 150 160 170 180 

1300 1310 1320 1330 1340 1350' 

MM- LOB TTGGTCCAGAAGGTGGTCCCCATGCTTCCCAGGCTTCTGTGTGAGGAACTCTGCAGCCTC 



HS - Lob TTGGTTCAAAAGGTGGTCCCCATGCTTCCCAGGCTGCTGTGTGAGGAGCTGTGCAGCCTC 

190 200 210 220 230 240 

1360 1370 1380 1390 1400 1410 

MM- LOB AACCCCATGACTGACAAGCTGACCTTCTCTGTGATCTGGAAGCTGACCCCTGAAGGCAAG 



HS - Lob AACCCCATGTCCGACAAGCTGACCTTGTCTGTGATCTGGACACTGACTCCAGAGGGCAAG 

250 260 270 280 290 300 

1420 1430 1440 1450 1460 1470 

MM - LOB ATCCTTGAAGAGTGGTTTGGCCGCACTATCATCCGTTCTTGCACCAAACTGAGCTACGAC 



HS - Lob ATCCTTGATGAATGGTTTGGCCGGACCATCATCCGCTCCTGCACCAAACTTAGCTACGAG 

310 320 330 340 350 360 

1480 1490 1500 1510 1520 1530 

MM-LOB CATGCCCAGAGCATGATCGAAAATCCAACTGAGAAGATCCCTGAGGAAGAGCTTCCCCCA 



HS-Lob CATGCACAGAGCATGATTGAAAGCCCAACTGAGAAAATCCCTGCGAAAGAGCTGCCCCCC 

370 380 390 400 410 420 

1540 1550 1560 1570 1580 1590 

MM- LOB ATTTCTCCAGAGCACAGCGTCGAGGAGGTGCACCAGGCAGTCCTGAACCTGCACAGCATT 



HS - Lob ATTTCCCCAGAGCATAGCAGCGAGGAGGTACACCAGGCCGTCTTGAATCTCCACGGAATT 

430 440 450 460 470 480 

1600 1610 1620 1630 1640 1650 

MM - LOB GCAAAGCAACTCCGCCGCCAGCGCTTTGTAGATGGCGCACTCCGTTTAGATCAGCTGAAG 



HS - Lob GCCAAGCAGTTACGCCAGCAGCGCTTTGTGGACGGCGCACTTCGTTTGGATCAGCTAAAG 

490 500 510 520 530 540 



1660 1670 1680 1690 1700 1710 

MM - LOB CTTGCTTTTACTCTGGACCATGAGACTGGACTGCCTCAAGGATGTCACATCTATGAGTAC 



HS - Lob CTTGCTTTCACTCTGGACCACGAGACCGGATTGCCTCAAGGATGTCATATCTATGAGTAC 

550 560 570 580 590 600 

1720 1730 1740 1750 1760 1770 

MM- LOB CGAGACAGCAACAAGCTTGTAGAGGAGTTCATGCTCCTGGCCAACATGGCGGTGGCCCAC 



HS - Lob CGCGAGAGCAACAAGCTCGTGGAGGAGTTCATGCTCTTGGCCAACATGGCAGTGGCCCAC 

610 620 630 640 650 660 

1780 1790 1800 1810 1820 1830 

MM- LOB AAGATCTTCCGCACCTTCCCTGAGCAGGCCCTGCTGCGCCGGCATCCCCCACCACAGACG 



HS - Lob AAGATCCACCGCGCCTTCCCCGAGCAGGCCCTGCTGCGCCGGCACCCCCCGCCCCAAACA 

670 680 690 700 710 720 

1840 1850 1860 1870 1880 1890 

MM- LOB AAGATGCTCAGTGACCTGGTGGAGTTCTGTGACCAGATGGGGCTGCCCATGGATGTCAGC 



HS - Lob AGGATGCTCAGTGACCTGGTGGAATTCTGCGACCAGATGGGGCTGCCCGTGGACTTCAGC 

730 740 750 760 770 780 

1900 1910 1920 1930 1940 -1950 

MM- LOB TCTGCAGGGGCCCTAAATAAAAGCCTGACTAAGACATTTGGAGATGACAAGTACTCTCTG 



HS - Lob TCCGCAGGAGCCCTCAATAAAAGCCTGACCCAAACATTTGGAGATGACAAGTACTCACTG 

790 800 810 820 830 840 , 

1960 1970 1980 1990 2000 2010 

MM- LOB GCCCGGAAGGAGGTGCTCACCAACATGTACTCCCGGCCCATGCAGATGGCACTGTACTTC 



HS - Lob GCCCGCAAGGAGGTGCTCACCAACATGTGCTCCCGGCCCATGCAGATGGCACTGTACTTC 

850 860 870 880 890 900 

2020 2030 2040 2050 2060 2070 

MM- LOB TGCTCTGGGATGCTGCAGGACCAGGAGCAGTTCCGGCATTATGCTCTCAACGTTCCCCTC 



HS - Lob TGCTCGGGGCTGCTGCAGGACCCAGCGCAGTTCCGGCACTACGCGCTCAATGTGCCCCTG 

910 920 930 940 950 960 

2080 2090 . 2100 2110 2120 2130 

MM- LOB TACACACACTTCACCTCTCCCATCCGCCGCTTTGCTGACGTCATAGTGCACCGCCTCCTG 



HS - Lob TACACACACTTCACCTCGCCCATCCGCCGCTTTGCCGACGTCCTGGTGCACCGCCTCCTG 

970 980 990 1000 1010 1020 

2140 2150 2160 2170 2180 2190 

MM- LOB GCTGCTGCTCTGGGCTACAGTGAACAGCCAGATGTGGAGCCTGATACCCTACAGAAGCAA 



HS - Lob GCTGCCGCGTTAGGCTATAGGGAGCGACTAGACATGGCGCCCGATACCCTGCAGAAACAG 

1030 1040 1050 1060 1070 1080 

2200 2210 2220 2230 2240 2250 - 

MM -LOB GCTGACCACTGCAATGACCGTCGCATGGCTTCCAAACGTGTGCAGGAGCTCAGCATCGGC 



HS-Lob GCGGACCACTGTAACGACCGCCGCATGGCGTCCAAGCGCGTGCAGGAGCTCAGTACCAGT 



ft 



1 



I 



1090 1100 1110 1120 1130 1140 

2260 2270' 2280 2290 2300 2310 

MM- LOB CTCTTCTTCGCAGTTCTAGTAAAGGAGAGTGGCCCCCTGGAGTCCGAAGCCATGGTGATG 



****** 



HS - Lob CTCTTCTTTGCTGTTCTGGTCAAGGAGAGTGGCCCCCTGGAGTCAGAAGCCATGGTGATG 

1150 1160 1170 1180 1190 1200 

2320 2330 2340 2350 2360 2370 

MM- LOB GGTGTCCTGAACCAAGCTTTCGACGTGCTGGTGCTGCGCTTTGGGGTGCAGAAGCGCATC 

....... ••••• ......»•...••-•••••••. •• 

HS - Lob GGCATCCTGAAGCAAGCCTTCGACGTGCTGGTGCTGGGCTACGGCGTGCAGAAGCGCATC 

1210 1220 1230 1240 1250 1260 

2380 2390 2400 2410 2420 2430 

MM- LOB TACTGCAATGCACTGGCCCTGCGATCCTACAGCTTCCAGAAGGTGGGGAAGAAGCCAGAG 

........ .............. ... •■*•**■* 

........ .............. ... •• ........ .. 

HS - Lob TACTGCAACGCACTGGCCCTGCGGTCCCACCACTTCCAGAAGGTGGGCAAGAAGCCGGAA 

1270 1280 1290 1300 1310 1320 

2440 2450 2460 2470 2480 2490 

MM- LOB CTCACTCTTGTTTGGGAGCCTGATGACCTTGAAGAGGAGCCAACACAGCAGGTCATCACC 



HS - Lob CTCACGCTGGTCTGGGAGCCTGAGGACATGGAGCAGGAGCCAGCACAGCAGGTCATCACC 

1330 1340 1350 1360 1370 1380 

2500 2510 2520 2530 2540 2550 

MM- LOB ATCTTCAGCCTGGTGGATGTGGTCCTGCAGGCAGAGGCCACAGCCCTCAAGTACAGTGCT 

ririiii 

HS - Lob ATCTTCAGCCTGGTGGAGGTGGTCCTGCAGGCAGAGTCCACAGCCCTCAAGTACAGCGCC 

1390 1400 1410 1420 1430 1440 

2560 2570 ' 2580 2590 
MM-LOB AT C CTGAAGCGAC CAGG C -CTGG AGAAGGCGTCT 



HS - Lob ATCCTGAAGCGGCCAGGCACCCAGGGCCACCTGGGCCCTGAGAAGGAGGAGGAGGAGTCT 

1450 1460 1470 1480 1490 1500 

2600 2610 
MM-LOB GATGAGGAGC C TGAGGAC TGA 

• ******* * 

*• * ***** ******* • 

HS-Lob GACGGTGAGCC CGAGGACT CA 

1510 1520 



i 



LOBO Protein-Alignment: Mouse (complete) vs. Man (partial, as known at the 
time of patent filing) 

Region of overlap 506 aa; Identiy [:] 88.1 %; Similarity [.] 94.8 % 
Program used: LALIGN (rigorous local alignment) 

380 390 400 410 420 430 

MM- LOB KDC I FT I D PSTARDLDDALACRRLTDGTFEVGVHI ADVS YFVPEGS S LDKVAAERATS VY 



HS-LOB KDCIFTIDPSTARDLDDALSCKPLADGNFKVGVHIADVSYFVPEGSDLDKVAAERATSVY 

10 20 30 40 50 60 

440 450 * 460 470 480 490 

MM- LOB LVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTI IRSCTKLSYD 



HS-LOB LVQKWPMLPRLLCEELCSLNPMSDKLTFSVIWTLTPEGKILDEWFGRTIIRSCTKLSYE 

70 80 90 100 110 120 

500 510 520 530 540 550 

MM- LOB HAQSMIENPTEKIPEEELPPISPEHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLK 



HS-LOB HAQSMIESPTEKIPAKELPPISPEHSSEEVHQAVLNLHGIAKQLRQQRFVDGALRLDQLK 

130 140 150 160 170 180 

560 570 580 590 600 610 

MM- LOB LAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQT 



HS-LOB LAFTLDHETGLPQGCHIYEYRESNKLVEEFMLLANMAVAHKIHRAFPEQALLRRHPPPQT 

190 200 210 220 230 240 

620 630 640 650 660 670 

MM- LOB KMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYF 



HS - LOB RMLSDLVEFCDQMGLPVDFSSAGALNKSLTQTFGDDKYSLARKEVLTNMCSRPMQMALYF 

250 260 270 280 - 290 300 

680 690 700 710 720 730 

MM- LOB CSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQ 



HS - LOB CSGLLQDPAQFRHYALNVPLYTHFTSPIRRFADVLVHRLLAAALGYRERLDMAPDTLQKQ 

310 320 330 340 350 360 

740 750 760 770 780 790 

MM -LOB ADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRI 



HS - LOB ADHCNDRRMASKRVQELSTSLFFAVLVKESGPLESEAMVMGILKQAFDVLVLRYGVQKRI 

370 380 390 400 410 420 

800 810 820 830 840 850 

MM- LOB YCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIFSLVDVVLQAEATALKYSA 



HS-LOB YCNALALRSHHFQKVGKKPELTLVWEPEDMEQEPAQQVITIFSLVEWLQAESTALKYSA 

430 440 450 460 470 480 

860 870 
MM-LOB ILKRPGL EKAS DEEPED 



HS-LOB ILKRPGTQGHLGPEKEEEESDGEPED 

490 500 



APPENDIX II 



LOBO Protein-Alignment: Mouse (complete) vs. Man (complete, as determined 
after patent filing) 

Region of overlap 881 aa; Identiy [:] 88 J %; Similarity [.] 96.2 % 
Program used: LALIGN (rigorous local alignment) 

10 20 30 40 50 60 

MM -LOB MNHPDYKLNLRS PGTPRGVSSVVGPSAVGAS PGDKKSKNKSMRGKKKS I FETYMSKEDVS 



HS-LOB MSHPDYRMNLRPLGTPRGVSAVAGPHDIGASPGDKKSKNRSTRGKKKS I FETYMSKEDVS 

10 20 30 40 50 60 

70 80 90 100 110 120 

MM - LOB EGLKRGTL I QGVLR INPKKFHEAF IPS PDGDRD I F I DG WARNRALNGDL WVKLL PEDQ 



HS - LOB EGLKRGTL I QGVLR INPKKFHEAF I PS PDGDRD I F I DG WARNRALNGDL WVKLL PEEH 

70 80 90 100 110 120 

130 140 150 160 170 

MM -LOB WKAVKPESNDKE I EAT YEAD I PEEGCGHHPLQQSRKGWS - GPDVI I EAQFDDSDS EDRHG 



HS-LOB WKWKPESNDKETEAAYESDIPEELCGHHLPQQSLKSYNDSPDVIVEAQFDGSDSEDGHG 

130 140 150 160 170 180 

180 . 190 200 210 220 230 

MM- LOB NTSG - LVDGVKKLS I STPDRGKEDS STPVMKDENTP I PQDTRGLSEKS LQKSAKWY I LE 



HS-LOB I TQNVLVDGVKKLS VCVS EKGREDGDAP VTKDETTC I S QDTRALS EKS LQRS AKWY I LE 

190 200 210 220 230 240 

240 250 260 270 280 290 

MM- LOB KKHSRAATGILKXjLADKNSDLFKXYALFSPSDHRVPRIYVP 



HS-LOB KKHSRAATGFLKiLADKlSISELFRKYALFSPSDHRVPRIYVPLKDCPQDFVARPKDYANTL 

250 260 ' 270 280 290 300 

300 310 320 330 340 350 

MM- LOB FICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLP 



HS-LOB FICRIVDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQGLP 

310 320 330 340 350 360 

360 370 380 390 400 410 

MM- LOB WTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVS YFVPEG 



HS - LOB WTIPPEEFSKRRDLRKDCIFTIDPSTARDLDDALSCKPLADGNFKVGVHIADVS YFVPEG 

370 380 390 400 410 420 

420 430 440 450 460 470 

MM- LOB SSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEW 



HS-LOB SDLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMSDKLTFSVIWTLTPEGKILDEW 

430 440 450 460 470 480 

480 490 500 510 520 530 

MM- LOB FGRTI I RS CTKL S YDHAQSMI ENPTEKI PEEEL P P I S P EHS VEE VHQAVLNLHS I AKQLR 



HS-LOB FGRTI I RS CTKLS YEHAQSMI ES PTEKI PAKELPP I S PEHS S EEVHQAVLNLHGI AKQLR 

490 500 510 520 530 540 



< 



540 * 550 560 570 \ 580 590 

MM -LOB RQRFVDGALRLDQLKLAFTLDHET 



* ■ • * * 



HS-LOB QQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRESNKLVEEFMLLANMAVAHKIHRA 

550 560 570 580 590 600 

600 610 620 630 640 650 

MM- LOB FPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEV 



• * • * ■ 



HS - LOB FPEQALLRRHPPPQTRMLSDLVEFCDQMGLPVDFSSAGALNKSLTQTFGDDKYSLARKEV 

610 . 620 630 640 650 660 

660 670 680 690 700 710 

MM-LOB LTNMYSRPMQMALYFCSGMLQDQEQFRHYALJWPLYTHFTSPIRRFADVIVHRLLAAALG 

: : : : :::::::::::::.::: 
HS-LOB LTNMCSRPMQMALYFCSGLLQDPAQFRHYALNVPLYTHFTSPIRRFADVLVHRLLAAALG 

670 680 690" 700 710 720 

720 730 740 750 760 770 

MM-LOB YSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGVLNQ 



HS-LOB YRERLDMAPDTLQKQADHCNDRRMASKRVQELSTSLFFAVLVKESGPLESEAMVMGILKQ 

730 740 750 760 ■ 770 780 

780 790 800 810 820 830 

MM-LOB AFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIFSLV 

I:::::::.:::::::::::::::. 
HS-LOB AFD VL VLR YGVQKR I YCNALALRSHHFQKVGKKPELTL VWE P EDMEQE PAQQVI T I F S LV 

790 800 810 820 830 840 

840 850 860 870 * 

MM - LOB DWLQAEATALKYSAI LKRPGL EKASDEEPED 



HS-LOB EWLQAESTALKYSAILKRPGTQGHLGPEKEEEESDGEPED 

850 860 870 880 



LOBO coding sequence (CDS)-alignment: Mouse (complete) vs. Man (complet 
as determined after patent filing) 

Region of overlap 2646 nt; Identiy [:] 85.9 % 
Program used; LALIGN (rigorous local alignment) 

10 20 30 40 50 60 

MM- LOB ATGAACCATCCTGACTACAAGCTGAACCTTCGGTCTCCGGGGACCCCCAGAGGTGTGTCC 



HS - Lob ATGAGCCATCCTGACTACAGAATGAACCTCCGGCCCCTGGGGACCCCCAGAGGTGTGTCT 

10 20 30 40 50 60 

70 . 80 90 100 110 120 

MM- LOB TCTGTGGTTGGCCCGAGTGCTGTTGGTGCTTCGCCAGGTGACAAAAAGTCAAAGAACAAG 



HS - Lob GCTGTGGCTGGTCCACATGACATTGGTGCTTCGCCAGGTGACAAAAAGTCAAAGAACAGG 

70 80 90 100 110 120 

130 140 150 160 170 180 

MM - LOB T C CATGCGAGGGAAGAAAAAGAGCATATTTGAAACCTACATGT C CAAGGAGGATGTTTCA 



HS - Lob TCCACACGAGGGAAGAAAAAGAGCATATTTGAAACTTACATGTCCAAGGAGGATGTTTCA 

130 140 150 . 160 170 180 

190 200 210 220 230 240 

MM- LOB GAAGG CTTGAAGAGAGGAACACTTAT C CAGGGTGTATTGAGAAT CAAC C CAAAGAAGTTT 



HS - Lob GAAGG CTTGAAGAGAGGAACACT CAT C CAGGGTGTATTGAGAATTAATC CAAAGAAGTTT 

190 200 210 220 230 240 

250 260 270 280 290 300 

MM- LOB CATGAAGCCTTCATTCCTTCTCCGGATGGTGATCGGGACATTTTTATTGATGGAGTTGTT 



HS - Lob CATGAAGCCTTCATTCCTTCCCCGGATGGTGATCGAGACATTTTTATTGATGGGGTTGTT 

250 260 270 280 290 300 

310 320 330 340 350 360 

MM- LOB GCTCGTAATAGAGCCTTAAATGGGGACCTTGTGGTTGTAAAACTGCTTCCTGAGGATCAG 



HS - Lob GCTCGTAATAGAGCCTTAAATGGGGATCTGGTGGTCGTGAAACTGCTTCCCGAGGAGCAT 

310 320 330 340 350 360 

370 380 390 400 410 420 

MM- LOB TGGAAGG CAGTTAAA C CAGAGAGCAATGA CAAAGAAATAGAAG CTACTTATGAAG CTGAC 



HS - Lob TGGAAGGTAGTTAAAC CAGAGAGCAATGA CAAAGAAA CAGAAG CTGCGTATGAAT CAGAT 

370 380 390 400 410 420 

430 440 450 460 470 

MM- LOB ATCCCTGAAGAGGGCTGTGGACATCACCCCCTGCAGCAGTCCCGGAAAGGCTGGAGTG- - 



HS - Lob ATCCCCGAGGAGCTCTGTGGACACCATCTCCCGCAACAGTCCCTGAAAAGCTATAATGAC 

430 440 450 460 470 480 

480 490 500 510 520 530 

MM- LOB - GTCCTGATGTCATTATAGAGGCTCAGTTTGATGACAGCGACTCAGAAGATAGACATGGC 



HS - Lob AGTCCTGATGTCATTGTAGAGGCTCAGTTTGATGGCAGCGACTCAGAAGATGGACATGGC 

490 , 500 510 520 530 540 



540 550 560 570 580 590 

MM- LOB AACACC AGTGGCCTGGTTGATGGTGTTAAGAAATTGTCAATCTCTACTCCTGACAGA 

* ■»* * mm ■ *■•*■**■••■«*»*■***• • •*a s L* * « * * * • * * * 

• • # • • mm • + + + ■ * * * m • * « ■ • * ♦ * * 

HS - Lob ATCACACAAAATGTGCTGGTTGATGGTGTTAAGAAACTCTCAGTTTGTGTTTCTGAGAAA 

550 560 570 580 590 600 

600 610 620 630 640 650 

MM- LOB GGAAAAGAAGATTCTAGTACTC CAG TTATGAAAGATGAGAA CACCCC CATAC CACAGGAC 

• •** • • * * + + » « * * * * * * * + * * * » 4 * • * * * 

■ • • * ♦ • » • » » * * • «* * * a * * * • * » • • 

HS - Lob GGAAGAGAGGATGGTGATGCACCGGTTACAAAAGATGAGACCACCTGCATTTCACAAGAC 

610 620 630 640 650 660 

660 670 680 690 700 710 

MM - LOB ACAAGAGGCTTATCAGAGAAGTCACTTCAGAAATCAGCAAAGGTGGTTTACATCTTGGAG 



HS-Lob ACAAGAGCTTTATCGGAGAAATCCCTGCAAAGATCAGCAAAGGTGGTTTACATCTTGGAG 

670 680 690 700 710 720 

720 730 740 750 760 770 

MM- LOB AAAAAGCATTCTCGAGCAGCAACTGGCATCCTGAAACTCTTGGCTGATAAGAACAGTGAC 



HS - Lob AAAAAACATTCTCGAGCAGCAACCGGCTTCCTCAAACTCTTGGCTGATAAGAACAGCGAA 

730 740 750 760 770 780 

780 790 800 810 820 830 

MM- LOB CTGTTTAAGAAATACGCCCTGTTTTCTCCTTCAGACCACCGAGTACCTAGAATTTACGTA 



HS - Lob CTGTTTAGGAAATACGCCCTGTTTTCTCCCTCAGACCACCGAGTGCCTAGAATTTATGTG 

790 800 810 820 830 840 

840 850 860 870 880 890 

MM- LOB CCTCTCAAGGACTGTCCCCAGGACTTCATGACCCGACCTAAAGACTTTGCCAACACGCTG 



HS - Lob CCTCTCAAGGACTGTCCCCAGGACTTTGTGGCACGGCCTAAAGATTATGCCAACACACTG 

850 860 870 880 890 900 

900 910 920 930 940 950 

MM- LOB TTCATCTGCCGCATCATAGACTGGAAGGAGGACTGTAATTTTGCCCTGGGGCAACTGGCT 



HS - Lob TTCATCTGCCGCATTGTGGACTGGAAGGAGGACTGCAATTTTGCCCTGGGGCAGCTGGCT 

910 920 930 940 ^950 960 

960 970 980 990 1000 1010 

MM -LOB AAGAGTCTTGGGCAGGCTGGTGAAATCGAGCCTGAAACAGAAGGGATACTGACAGAATAT 



HS - Lob AAGAGTCTTGGGCAGGCTGGTGAAATTGAGCCTGAAACAGAAGGAATACTAACAGAGTAT 

970 980 990 1000 1010 1020 

1020 1030 1040 1050 1060 1070 

MM- LOB GGTGTGGACTTCTCTGATTTCTCTTCAGAAGTTCTTGAATGTCTCCCTCAAAGCCTGCCC 



HS - Lob GGCGTGGATTTCTCTGATTTCTCTTCAGAAGTTCTAGAATGTCTTCCTCAAGGCCTGCCA 

1030 1040 1050 1060 1070 1080 



1080" 1090 1100 1110 1120 1130 

MM- LOB TGGACAATCC CACCTGATGAGGTGGGCAAGAGAAGAGACCTAAGGAAAGACTGTAT CTTC 



HS -Lob TGGACAATTCCACCAGAGGAGTTCAGCAAGAGAAGGGATTTAAGAAAAGACTGTATCTTC 

1090 1100 1110 1120 1130 1140 

1140 1150 1160 1170 1180 1190 

MM- LOB ACCATTGATCCATCAACTGCTCGCGACCTTGATGATGCCCTCGCCTGCAGGCGGCTCACT 



HS - Lob ACCATTGACCCATCAACCGCCCGAGACCTCGATGATGCCCTCTCCTGCAAGCCACTCGCT 

1150 1160 1170 1180 1190 1200 

1200 1210 1220 1230 1240 1250 

MM- LOB GATGGCACCTTCGAAGTGGGCGTCCACATCGCCGATGTGAGTTACTTTGTTCCTGAGGGA 



HS - Lob GACGGCAACTTCAAAGTGGGAGTTCACATTGCTGACGTGAGTTACTTTGTTCCGGAGGGA 

1210 1220 1230 1240 1250 1260 

1260 1270 1280 1290 1300 1310 

MM- LOB TCCTCTTTGGATAAAGTAGCTGCTGAGAGAGCCACAAGTGTCTACTTGGTCCAGAAGGTG 



HS - Lob TCTGATCTGGATAAAGTGGCTGCCGAGAGGGCTACAAGCGTCTACTTGGTTCAAAAGGTG 

1270 1280 1290 13-0 0 1310 1320 

1320 1330 1340 1350 1360 1370 

MM -LOB GTCCCCATGCTTCCCAGGCTTCTGTGTGAGGAACTCTGCAGCCTCAACCCCATGACTGAC 



HS - Lob GTCCCCATGCTTCCCAGGCTGCTGTGTGAGGAGCTGTGCAGCCTCAACCCCATGTCCGAC 

1330 1340 1350 1360 1370 1380 

1380 1390 1400 1410 1420 1430 

MM - LOB AAGCTGACCTTCTCTGTGATCTGGAAGCTGACCCCTGAAGGCAAGATCCTTGAAGAGTGG 



HS - Lob AAGCTGACCTTCTCTGTGATCTGGACACTGACTCCAGAGGGCAAGATCCTTGATGAATGG 

1390 1400 1410 1420 1430 1440 

1440 1450 1460 1470 1480 1490 

MM- LOB TTTGGCCGCACTATCATCCGTTCTTGCACCAAACTGAGCTACGACCATGCCCAGAGCATG 



HS - Lob TTTGGCCGGACCATCATCCGCTCCTGCACCAAACTTAGCTACGAGCATGCACAGAGCATG 

1450 1460 1470 1480 1490 1500 

1500 1510 1520 1530 1540 1550 

MM- LOB AT CGAAAAT C CAACTGAGAAGAT C C CTGAGGAAGAGCTT C CC C CAAT TT CT C CAGAGCAC 

• ♦ ***.**#*«*«* + m *•*«**» * • • + • *»*4*««* 

* • * + * » **••**-*#*** ******* * ******* ***** v * # ■ * ******** 

HS - Lob ATTGAAAGCCCAACTGAGAAAATCCCTGCGAAAGAGCTGCCCCCCATTTCCCCAGAGCAT 

1510 1520 1530 1540 1550 1560 

1560 1570 1580 1590 1600 1610 

MM -LOB AGCGTCGAGGAGGTGCACCAGGCAGTCCTGAACCTGCACAGCATTGCAAAGCAACTCCGC 



HS - Lob AGCAGCGAGGAGGTACAC CAGGCCGTCTTGAATCTC CACGGAATTG C CAAG CAGTTACGC 

1570 1580 .1590 1600 1610 1620 

1620 1630 1640 1650 1660 1670 

MM- LOB CGCCAGCGCTTTGTAGATGGCGCACTCCGTTTAGATCAGCTGAAGCTTGCTTTTACTCTG 
-> . . ...... 

HS - Lob CAGCAGCGCTTTGTGGACGGCGCACTTCGTTTGGATCAGCTAAAGCTTGCTTTCACTCTG 

1630 1640 1650 1660 1670 1680 



1680 1690 1700 1710 1720 1730 

MM -LOB GACCATGAGACTGGACTGCCTCAAGGATGTCACATCTATGAGTACCGAGACAGCAACAAG 

HS - Lob GACCACGAGAC CGGAT TGC CTCAAGGATGT CATAT CTATGAGTAC GGCGAGAGCAACAAG 

1690 1700 1710 1720 1730 1740 

1740 1750 1760 1770 1780 1790 

MM- LOB CTTGTAGAGGAGTTCATGCTCCTGGCCAACATGGCGGTGGCCCACAAGATCTTCCGCACC 



HS - Lob CTCGTGGAGGAGTTCATGCTCTTGGCCAACATGGCAGTGGCCCACAAGATCCACCGCGCC 

1750 1760 1770 1780 1790 1800 

1800 1810 1820 1830 1840 1850 

MM- LOB TTCCCTGAGCAGGCCCTGCTGCGCCGGCATCCCCCACCACAGACGAAGATGCTCAGTGAC 



HS - Lob TTCCCCGAGCAGGCCCTGCTGCGCCGGCACCCCCCGCCCCAAACAAGGATGCTCAGTGAC 

1810 1820 1830 1840 1850 1860 

1860 1870 1880 1890 1900 1910 

MM- LOB CTGGTGGAGTTCTGTGACCAGATGGGGCTGCCCATGGATGTCAGCTCTGCAGGGGCCCTA 



HS - Lob CTGGTGGAATTCTGCGACCAGATGGGGCTGCCCGTGGACTTCAGCTCCGCAGGAGCCCTC 

1870 1880 1890 1900 1910 1920 

1920 1930 1940 1950 . 1960 1970 

MM- LOB AATAAAAGCCTGACTAAGACATTTGGAGATGACAAGTACTCTCTGGCCCGGAAGGAGGTG 



HS - Lob AATAAAAGCCTGACCCAAACATTTGGAGATGACAAGTACTCACTGGCCCGCAAGGAGGTG 

1930 1940 1950 1960 1970 1980 

1980 1990 2000 2010 2020 2030 

MM- LOB CTCACCAACATGTACTCCCGGCCCATGCAGATGGCACTGTACTTCTGCTCTGGGATGCTG 



HS - Lob CTCACCAACATGTGCTCCCGGCCCATGCAGATGGCACTGTACTTCTGCTCGGGGCTGCTG 

1990 2000 2010 2020 2030 2040 

2040 2050 2060 2070 2080 2090 

MM- LOB CAGGACCAGGAGCAGTTCCGGCATTATGCTCTCAACGTTCCCCTCTACACACACTTCACC 



HS - Lob CAGGACCCAGCGCAGTTCCGGCACTACGCGCTCAATGTGCCCCTGTACACACACTTCACC 

2050 2060 2070 2080 2090 2100 

2100 2110 2120 2130 2140 2150 

MM -LOB TCTCCCATCCGCCGCTTTGCTGACGTCATAGTGCACCGCCTCCTGGCTGCTGCTCTGGGC 



HS - Lob TCGCCCATCCGCCGCTTTGCCGACGTCCTGGTGCACCGCCTCCTGGCTGCCGCGTTAGGC 

2110 2120 2130 2140 2150 2160 

2160 2170 2180 2190 2200 2210 

MM- LOB TACAGTGAACAGCCAGATGTGGAGCCTGATACCCTACAGAAGCAAGCTGACCACTGCAAT 



HS - Lob TATAGGGAGCGACTAGACATGGCGCCCGATACCCTGCAGAAACAGGCGGACCACTGTAAC 

2170 2180 2190 2200 2210 2220 



2220 2230 2240 2250 2260 2270 

MM- LOB GACCGTCGCATGGCTTCCAAACGTGTGCAGGAGCTCAGCATCGGCCTCTTCTTCGCAGTT 



HS - Lob GACCGCCGCATGGCGTCCAAGCGCGTGCAGGAGCTCAGTACCAGTCTCTTCTTTGCTGTT 

2230 2240 2250 2260 2270 2280 

2'280 2290 2300 2310 2320 2330 

MM- LOB CTAGTAAAGGAGAGTGGCCCCCTGGAGTCCGAAGCCATGGTGATGGGTGTCCTGAACCAA 



HS - Lob CTGGTCAAGGAGAGTGGC C C C CTGGAGT C AGAAGCCATGGTGATGGG CAT C C TGAAGCAA 

2290 2300 2310 2320 2330 2340 

2340 2350 2360 2370 2380 2390 

MM- LOB GCTTTCGACGTGCTGGTGCTGCGCTTTGGGGTGCAGAAGCGCATCTACTGCAATGCACTG 



HS - Lob GCCTTCGACGTGCTGGTGCTGCGCTACGGCGTGCAGAAGCGCATCTACTGCAACGCACTG 

2350 2360 2370 2380 2390 2400 

2400 2410 2420 2430 2440 2450, 

MM- LOB GCCCTGCGATCCTACAGCTTCCAGAAGGTGGGGAAGAAGCCAGAGCTCACTCTTGTTTGG 



HS-Lob GCCCTGCGGTCCCACCACTTCCAGAAGGTGGGCAAGAAGCCGGAACTCACGCTGGTCTGG 

2410 2420 2430 2440 2450 2460 

2460 2470 2480 2490 2500 2510 

MM- LOB GAGCCTGATGACCTTGAAGAGGAGCCAACACAGCAGGTCATCACCATCTTCAGCCTGGTG 



HS - Lob GAGCCTGAGGACATGGAGCAGGAGCCAGCACAGCAGGTCATCACCATCTTCAGCCTGGTG 

2470 2480 2490 2500 2510 2520 

2520 2530 " 2540 2550 2560 2570 

MM- LOB GATGTGGTCCTGCAGGCAGAGGCCACAGCCCTCAAGTACAGTGCTATCCTGAAGCGACCA 



HS - Lob GAGGTGGTCCTGCAGGCAGAGTCCACAGCCCTCAAGTACAGCGCCATCCTGAAGCGGCCA 

2530 2540 2550 2560 2570 2580 

2580 2590 2600 

MM -LOB GGC CTGG AGAAGGCGTCTGATGAGGAGCCTGAG 



HS - Lob GGCACCCAGGGCCACCTGGGCCCTGAGAAGGAGGAGGAGGAGTCTGACGGTGAGCCCGAG 

2590 2600 2610 2620 2630 2640 

2610 
MM- LOB GACTGA 

« * * * * 
■ • ■ * • 

HS-Lob GACTCA 



i 



TABLE 1 



Blast of complete mouse LOBO protein against translated GenBank produced hits with 
significant similarity with the following DB-entries (Blast performed 30-Jan-2003): 

All other matches were significantly lower than 70 % aa identity 

RESULT: no entry with significant (i.e. >70 % aa identity) was older than May 2002 ! ! 



Similarity 


Accession 


Entry date 


Remark 


818/870(94%) 


NP_705758.1 


26-JAN-2003 




801/831 (96%) 


BAC2 7292.1 


05-DEC-2002 




720/772 (93%) 


XP_129937.2 


16-NOV-2002 


version 1 : 08-OCT-2002 


647/663 (97%) 


XP_1 103 18.2 


15-NOV-2002 


version 1: 30-APR-2002 


569/590 (96%) 


BAC2 6549.1 


05-DEC-2002 




451/532 (84%) 


AAH36113.1 


23-SEP-2002 




410/527 (77%) 


BAC04324.1 


15-JUL-2002 




167/188 (88%) 


BAC03400.1 


15-JUL-2002 




119/135(88%) 


AAH30113.1 


20-MAY-2002 




117/128 (91%) 


XP_23 7349.1 


28-JAN-2003 





TABLE 2 

The detailed Blast results are as follows: 



BLASTP 2.2.5 [version from Nov-1 6-2002] 
Reference: 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

RID: 1043949663-07491-641 

Query= MM-LOBO 871 bp PROT 6-NOV-1998, 
871 bases, 670 checksum. 
(870 letters) 



Database: All non-redundant GenBank CDS 
translations-fPDB+SwissProt+PIR+PRF 

1,321,324 sequences; 423,161,544 total letters 













Score 




E 


Sequences producing significant alignments: 


(bits) 


Value 


gi 


24233556 


ref 


NP 705758.1 


| RIKEN cDNA 4930429A22; hypo the . . . 


1640 


0 


. 0 


gi 


26327097 


dbj 


BAC27292 . 1 | 


unnamed protein product [Mus mu. . . 


1605 


0 


. 0 


gi 


25047588 


ref 


XP_129937 .2 


hypothetical protein MGC37640 . . . 


1453 


0 


. 0 


gi 


25019857 


ref 


XP_110318 .2 


similar to hypothetical protei... 


1303 


0 


. 0 


gi 


26325590 


dbj 


BAC26549.l| 


unnamed protein product [Mus mu. . . 


1166 


0 


. 0 


gi 


23271317 


gb|AAH3 6113 .1 | 


Unknown (protein for MGC:33943) ... 


916 


0 


. 0 


gi 


21753324 


dbj (BAC043 24 . 1 | 


unnamed protein product [Homo s . . . 


812 


0 


. 0 


gi 


21292324 


gb| EAA04469 . 1 | 


agCP3212 [Anopheles gambiae str. . . . 


424 


e 


-117 


gi 


19115422 | 


ref 


NP_5 94 510 . 1 


ribonuclease II RNB family pro. . . 


418 


e 


-115 


gi 


15220899 


ref 


NP_177891 . 1 


putative 3* -5' exoribonuclease . . . 


414 


e 


-114 



gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 



25406521 
25332397 
24654592 
19922976 
15292611 
18488261 
21748526 
26327903 
13446610 



pir 
pir 
ref 
ref 



E96806 
A84553 



NP 
NP 



728490 
612012 



hypothetical protein T32E8.1 [impor. 
probable mitotic control protein di . 

[Drosophila melanog. 
[Drosophila melanog 



1 
1 



CGI 6 94 0 -PC 
CGI 6 94 0 -PA 



gb | AAK93574 . 1 | SD10981p [Drosophila melanogaster] 
XP_081229.l| Dis3 [Drosophila melanogaster].. 



ref 
dbj 
dbj 
emb 



BAC03400 . 1 
BAC2 76 92 . 1 
CAC35051 . 1 



52 62619 | emb | CAB4 574 9.1 | 
2 77033 87 | ref | XP_22444 9 . 1 
19923416 | ref |NP__055768 .2 
6324552 | ref 



21297331 
14250908 
19113445 
15021874 
19115966 
18916779 
27369724 
14250916 
15559519 
23136616 
19173030 



gb 
emb 
ref 
dbj 
ref 
dbj 
ref 
emb 
gb 
gb 
ref 



NP_014621. 1 | 
EAA09476 . 1 | 
CAC39259. l| 
NP_596653 . 1 
BAB62212 . 1 | 
NP_588616 . 1 
BAB8554l.l| 
NP_766107 . 1 
CAC39263 . 1 



7493807 | pir | 



22748821 
27685983 



ref 
ref 



15616115 
23099883 
21673344 
15672924 
21754656 
23020108 
16801605 
16804487 
21397569 
16080414 
15894003 
27467483 
24379988 
19113103 
21282471 
15923770 
27685919 



ref 
ref 
ref 
ref 
dbj 
gb 



ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 



FLJ003 27 protein [Homo sapiens] 
unnamed protein product [Mus mu . . . 
putative exoribonuclease DIS3 [... 
hypothetical protein [Homo sapiens] 
similar to mitotic control pro... 
mitotic control protein dis3 h. . . 
Possible component of RCCl-Ran . . . 
ebiP3 7 04 [Anopheles gambiae str. . . . 
Rrp44p homologue [Trypanosoma b... 

mitotic control protein dis3 [... 
hypothetical protein [Macaca fa... 

hypothetical protein MGC4562 [... 
KIAA1955 protein [Homo sapiens] 

hypothetical protein 4932411M1... 
putative ribonuclease II-like p... 
AAH14124 . 1 | AAH14124 Unknown (protein for IMA... 
ZP 00118334.1 1 hypothetical protein [Cytopha... 
NP_597581.l| similarity to DIS3 PROTEIN (RN... 
T3 0524 protein phosphatase Ssdl homolog - y. . . 
NP_689596.1 hypothetical protein MGC42174 ... 
XP~237349.1 similar to hypothetical protei... 

Product of gene unknown; Ssdlp 
ribonuclease R; virulence-asso . . . 
ribonulease R [Oceanobacillus . . . 
ribonuclease II family protein .. . 
ribonuclease [Lactococcus lact... 
unnamed protein product [Homo s. . . 
hypothetical protein [Clostri... 
1 similar to exoribonuclease RNa . . . 
1 similar to exoribonuclease RNa. . . 
1 RNB , RNB-like protein [Bacillu. . . 
1 similar to hypothetical protei... 
1 FUSION ribonuclease and riboso. . , 
1 ribonuclease R [Staphylococcus.., 
1 putative exoribonuclease R {RN. . 
1 hypothetical protein; ribonucl . . 
1 ribonuclease R [Staphylococcus.. 
1 conserved hypothetical protein. . 
l| similar to hypothetical protei 



6320499 | ref |NP_0 10579 . 1 | 



NP_ 
NP 
NP~ 
NP 



244420 
^693349 
"661409 
'267098 



1 

1 
1 
1 



BAC04542 . 1 | 
ZP 00059814.1 



NP_ 
NP_ 
NP_ 
NP~ 
NP_ 
NP_ 
NP_ 
NP~ 
NP_ 
NP 
XP 



471873 
465972 
"653554 
J391241 
~347352 
^764120 
721943 
"596311 
"645559 
J371304 
"217466 



6473411 1 dbj |BAA87129.l| Hypothetical protein [Schizosacc 



15926457 
19745621 
15674608 
21909888 
25011590 
27685993 
22537625 
15902922 
15900852 
15673187 
15805382 
15605122 
15618415 
20807463 
23024105 
27380223 
15835291 
23053471 
20881074 



ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
ref 
gb 



NP_373990 . 1 I 
NP_606757 .1 I 
NP__268782 
NP_664156 
NP_735985 
XP_2 3 73 51 
NP_688476 
NP__3 584 72 
NP_3 4 54 56 . 1 
NP_2 673 61 . 1 
NP__2 94 0 76 
NP_219907 
NP_224700 
NP_622634 
ZP 00063328 



1 
1 
1 
1 
1 
1 



1 
1 
1 
1 



ref NP_771752 . 1 
ref NP_297050.1 
gb|ZP_00079680.1 
reflXP 127907.1 



ribonuclease R [Staphylococcus 
putative exoribonuclease R [St. 
putative exoribonuclease R [St. 
putative exoribonuclease R [St. 
Unknown [Streptococcus agalact . 
similar to hypothetical protei. 
exoribonuclease, VacB/Rnb fami. 
Exoribonuclease R [Streptococc . 
exoribonuclease, VacB/Rnb fami. 
ribonuclease [Lactococcus lact. 
ribonuclease [Deinococcus radi . 
Ribonuclease Family [Chlamydia, 
ribonuclease family [Chlamydop. 
Exoribonucleases [Thermoanaero . 

hypothetical protein [Leucono. 
exoribonuclease [Bradyrhizobiu . 
exoribonuclease-, VacB/Rnb fami . 

hypothetical protein [Geobact . 
RIKEN cDNA 2810028N01 [Mus mus . 
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ref |NP_214403 . 1 
ref |NP_603505.1 
ref |NP_562212 . 1 
gblZP 00070371.: 



ref 
ref 



20071792 |gb [AAH27357 . 1 | Similar to mitotic control prote . . 

vacB protein [Thermotoga marit . . 

hypothetical protein [Haemoph. . 
VacB protein (ribonuclease II . . 
Exoribonuclease II [Fusobacter. . 
ribonuclease R [Clostridium pe.. 

hypothetical protein [Oenococ , . 
ribonuclease R [Vibrio cholera,. 
PROBABLE EXORIBONUCLEASE RNASE . . 

hypothetical protein [Xylella.. 
ribonuclease R (RNase R) [Salm. . 

hypothetical protein [Nitroso. . 
putative enzyme [Escherichia c.. 
putative enzyme [Escherichia c.. 
Ribonuclease R (RNase R) (Vac. . 
391901 | dbj |BAA01777.l| ORF-2 [Shigella flexneri] 
24115534 | ref |NP_710044 . 1 | putative enzyme [Shigella flex 
27735249 
26553786 
27364704 
21242317 
15600130 
23003758 
26251071 



1564348 5 | ref | NP_22 8 53 1 . 1 
23 467 93 8 j gb | ZP__0 0 123 514 . 1 1 
15607021 
19703943 
18310278 
23038196 
15642594 
17545947 
| 22995402 
" 16763187 
22954374 
16132001 

15804768 _ 
731093 | sp|P214 99 |RNR_ECOLI 



NP__232227.l| 
NP_519349.l| 
gb|ZP_00039879.l| 
ref |NP__458804.l| 
gb|ZP_00002175.l| 
ref |NP_418600 . 1 
ref IMP 290809.1 



sp|P3085l|RNR_SHIFL Ribonuclease R (RNase R) (V. . 



ref 
ref 
ref 
ref 

gb| ZP_00047408 . 1 
ref |NP_757111 . 1 | 



NP_ 
NP~ 
NP 
NP~ 



757720 
"760232 
641899 
253624 



1 
1 
1 
1 



3* -5' exoribonuclease RNase R 
Exoribonuclease R [Vibrio vuln. . 
RNase R [Xanthomonas axonopodi . . 
exoribonuclease RNase R [Pseud. . 

hypothetical protein [Lactoba. . 
Ribonuclease R [Escherichia co . . 
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11359056 | pir | | T45283 growth polarity maintenance protein., 

Alignments 

RIKEN cDNA 4 93 042 9A22; hypothetical protein 
Unknown (protein for MGC: 37640) [Mus musculus] 



>gi | 24233 5 56 | ref |NP_7 05758 . 1 
MGC3 7640 [Mus musculus] 
gi | 23273 6 03 | gb | AAH3 6177 . 1 | 
Length = 8 70 



Score - 1640 bits (4247), Expect = 0.0 

Identities = 818/870 (94%) , Positives = 818/870 (94%) 

Query : 1 MNHPDYKLNLRXXXXXXXXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXIFETYMSKEDVS 6 0 

MNHPDYKLNLR D IFETYMSKEDVS 

Sbjct: 1 MNHPDYKLNLRSPGTPRGVSSWGPSAVGASPGDKKSKNKSMRGKKKSIFETYMSKEDVS 60 

Query : 61 EGLKRGTL I QGVLR I NPKKFHEAF I PS PDGDRD I F I DG VVARNRALNGDL WVKLL PEDQ 12 0 

EGLKRGTL I QGVLR INPKKFHEAF I P S PDGDRD I F IDGWARNRALNGDLWVKLLPEDQ 
Sbjct: 61 EGLKRGTL I QGVLR INPKKFHEAF I PS PDGDRD I F IDGVVARNRALNGDL VVVKLLPEDQ 12 0 

Query: 121 WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQFDDSDSEDRHGN 180 

WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQFDDSDSEDRHGN 
Sbjct: 121 WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQFDDSDSEDRHGN 180 

Query: 181 TSGLTOGVKKLSISTPDRGKFlDSSTPvKKIJENTPIPQDTRGLSEKSLQKSAKVVYILEKK 240 

TSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKVVYILEKK 
Sbjct: 181 TSGLVT)GVKKLSISTPDRGKEDSSTPVTVIKX)ENTPIPQDTRGLSEKSLQKSAKVVYILEKK 240 

Query: 241 HSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFI 3 00 

HSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFI 
Sbjct: 241 HSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFI 300 

Query: 301 CRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWT 3 60 

CRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWT 
Sbjct: 301 CRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWT 360 

Query: 361 IPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSS 420 

IPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSS 
Sbjct: 361 IPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSS 420 



Query: 421 LDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFG 480 



LDKVAAERATSWLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFG 
Sbjct: 421 LDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKIiTPEGKILEEWFG 480 

Query: 481 RT I I R S CTKL S YDHAQSM I ENXXXXXXXXXXXXXXXXHS VE EVHQAVLNLHS I AKQLRRQ 540 

RTIIRSCTKLSYDHAQSMIEN HSVEEVHQAVLNLHS IAKQLRRQ 

Sbjct: 481 RTIIRSCTKLSYDHAQSMIENPTEKIPEEELPPISPEHSVEEVHQAVLNLHSIAKQLRRQ 540 

Query: 541 RFvDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLIJmMAVAHKIFRTFP 600 

RFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFP 
Sbjct: 541 RFVDGALRLDQLKIAlFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFP 600 

Query: 601 EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 660 

EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 
Sbjct: 601 EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 660 

Query: 661 NMYSRPMQlVLkLYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADViraRLLAAALGYS 720 

NMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAAIiGYS 
Sbjct: 661 NMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYS 72 0 

Query: 721 E Q PD VE PDT LQKQADHCND RRMAS RR VQEL S I G L F FAVL VKE S G P L E S E AMVMGVLNQAF 780 

EQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGVLNQAF 
Sbjct: 721 EQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGVLNQAF 78 0 

Query: 781 DVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLWEPDDLEEEPTQQVITIFSLVDV 84 0 

DVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIFSLVDV 
Sbjct: 781 DVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLWEPDDLEEEPTQQVITIFSLVDV 840 

Query: 841 VLQAEATALKYSAILKRPGLEKASDEEPED 870 

VLQAEATALKYSAILKRPGLEKAS DEEPED 
Sbjct: 841 VLQAEATALKYSAILKRPGLEKASDEEPED 870 

>gi | 26327097 | dbj | BAC27292 . 1 | unnamed protein product [Mus musculus] 
Length =831 

Score = 1605 bits (4155), Expect = 0.0 

Identities = 801/831 (96%) , Positives - 801/831 (96%), Gaps = 14/831 (1%) 

Query: 54 MSKEDVSEGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNGDLWV 113 

MS KEDVS EGLKRGTL I QGVLR INPKKFHEAF IPS PDGDRD I FIDGWARNRALNGDL WV 
Sbjct: 1 MS KEDVS EGLKRGTL I QGVLR I NP KKFHEAF I PS PDGDRD I F IDG WARNRALNGDL WV 60 

Query: 114 KLLPEDQWK AVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWS 159 

KLLPEDQWK AVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWS 
Sbjct : 61 KLLPEDQWKPRITLSLPGVLGLQAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWS 120 

Query: 160 GPDVIIEAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDT 219 

GPDVIIEAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDT 
Sbjct: 121 GPDVIIEAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDT 180 

Query: 220 RGLSEKSLQKSAKWYILEKKHSRAATGILKLLADK3STSDLFKKYALFSPSDHRVPRIYVP 279 

RGLSEKSLQKSAKVVYILEKXHSRAATGILKliLAJDKNSDLFKXYALFSPSDHRVPRIYVP 
Sbjct: 181 RGLSEKSLQKSAKVWILEKKHSRAATGILKJiLADKNSDLFKKYALFSPSDHRVPRIYVP 240 

Query: 2 80 LKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYG 33 9 

LKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYG 
Sbjct: 241 LKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYG 300 

Query: 340 VDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTD 399 

VDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTD 
Sbjct: 301 VDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTD 360 

Query: 400 GTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDK 459 

GTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDK 
Sbjct: 361 GTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDK 420 



Query: 460 . LTFSVIWKXTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHS 519 

LTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIEN HS 
Sbjct: 421 LTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENPTEKIPEEELPPISPEHS 480 

Query: 520 VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKL 5 79 

VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKL 
Sbjct: 481 VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKL 54 0 

Query: 580 VEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALN 63 9 

VEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDIiVEFCDQMGLPMDVSSAGALN 
Sbjct: 541 VEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALN 600 

Query: 640 KSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALMVPLYTHFTS 699 

KSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTS 
Sbjct: 601 KSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTS 660 

Query: 700 P I RRFAD VI VHRL LAAALG YS EQ PD VE PDTLQKQADHCNDRRMAS KRVQE L S I GL F FAVL 759 

PIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVL 
Sbjct: 661 PIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVL 720 

Query: 760 VKESGPLESEAIVrWGVLNQAFDVLVLRFGVQK^IYCNALALRSYSFQKVGKKPELTLVWE 819 

VKESGPLESEAMVMGVLNQAFDVLVLRFGVQK^IYCNAIALRSYSFQKVGKKPELTLVWE 
Sbjct: 721 VKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWE 780 

Query: 820 PDDLEEEPTQQVITIFSLVDVVLQAEATALKYSAILKRPGLEKASDEEPED 870 

PDDLEEEPTQQVITIFSLVDWLQAEATALKYSAILKRPGLEKASDEEPED 
Sbjct: 781 PDDLEEEPTQQVITIFSLVDWLQAEATALKYSAILKRPGLEKASDEEPED 831 

>gi | 25047588 | ref |XP_129937 . 2 | hypothetical protein MGC37640 [Mus musculus} 
Length = 819 

Score = 1453 bits (3762), Expect = 0.0 

Identities = 720/772 (93%), Positives = 720/772 (93%) 

Query : 1 MNHPDYKLNLRXXXXXXXXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXI FETYMS KEDVS 6 0 

MNHPDYKLNLR D I FETYMS KEDVS 

Sbjct: 1 MNHPDYKLJSTLRSPGTPRGVSSWGPSAVGAS PGDKKSKNKSMRGKKKS I FETYMS KEDVS 60 

Query: 61 EGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNGDLVWKLLPEDQ 120 

EGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGVVARNRALNGDLVVVKLLPEDQ 
Sbjct : 61 EGLKRGT L I QGVLR I NPKKFHEAF IPS PDGDRD I F I DGWARNRALNGD L VWKLL P EDQ 120 

Query: 121 WKAVKPESNDKEIEATYEADI PEEGCGHHPLQQSRKGWSGPDVI IEAQFDDSDSEDRHGN 180 

WKAVKPESNDKEIEATYEADI PEEGCGHHPLQQSRKGWSGPDVI IEAQFDDSDSEDRHGN 
Sbjct: 121 WKAVKPESNDKEIEATYEADI PEEGCGHHPLQQSRKGWSGPDVI IEAQFDDSDSEDRHGN 180 

Query: 181 TSGLVDGVKKX.SISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKVVYILEKK 240 

TSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKWYILEKK 
Sbjct: 181 TS GLVDGVKKLS I S T PDRGKEDS S T PVMKDENTP I PQDTRGLS EKS LQKS AKWY I LEKK 240 

Query: 241 HSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFI 300 

HSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFI 
Sbjct: 241 HSRAATGILKLIADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFI 300 

Query: 301 CRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWT 360 

CRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWT 
Sbjct: 30T CRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWT 360 

Query: 361 IPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSS 42 0 

IPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSS 
Sbjct: 361 I PPDEVGKRRDLRKDC I FT IDPSTARDLDDALACRRLTDGTFEVGVHI ADVS YFVPEGS S 420 

Query: 421 LDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFG 480 

LDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFG 
Sbjct: 421 LDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFG 480 



V 



Query: 481 RTIIRSCTKLSYDHAQSMIEISrXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQ 540 

RTIIRS CTKLS YDHAQ SM I EN HS VEE VHQAVLNLHS I AKQLRRQ 

Sbjct: 481 RTIIRSCTKLSYDHAQSMIENPTEKIPEEELPPISPEHSVEEVHQAVLNLHSIAKQLRRQ 540 

Query: 541 RFVDGALRLDQLKXAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLIiANMAVAHKIFRTFP 600 

RFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFP 
Sbjct: 541 RFVDGALRLDQLKIAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFP 600 

Query: 601 EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 6 60 

EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 
Sbjct: 601 EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 660 

Query: 661 NMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYS 720 

NMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYS 
Sbjct: 661 NMYSRPMQMALYFCSGMLQDQEQFRHYALOTPLYTHFTSPIRRFADVIVHRLLAAALGYS 720 

Query: 721 EQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMV 772 

EQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMV 
Sbjct: 721 EQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMV 772 

>gi|25019857|ref |XP_110318.2 | similar to hypothetical protein MGC37640 [Mus 
musculus] 

Length =6 63 
Score = 1303 bits (3371), Expect = 0.0 

Identities = 647/663 (97%), Positives = 647/663 (97%) 

Query: 208 MKDENTPIPQDTRGLSEKSLQKSAKVVYILEKXHSRAATGILKLLADKNSDLFKKYALFS 267 

MKDENTPIPQDTRGLSEKSLQKSAKWYILEKXH^ 
Sbjct: 1 MKDENTPIPQDTRGLSEKSLQKSAKVVYILEKXHSRAATGILKLLMKNSDLFKKYALFS 60 

Query: 268 PSDHRVPRIYVPLKDCPQDFMTRPPCDFAISrTLFICRIIDWKEDCNFALGQLAKSLGQAGEI 327 

PSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEI 
Sbjct : 61 PSDHRVPRIYVPLKDCPQDFMTRPKDFANTIaFICRI IDWKEDCNFALGQLAKSLGQAGEI 120 

Query: 328 EPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARD 3 87 

EPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARD 
Sbjct: 121 EPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARD 180 

Query: 388 LDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLC 447 

LDDAIiACRRLTDGTFEVGWIADVSYFVPEGSSLDKVAAERATSVYLVQKVVPMLPRLLC 
Sbjct: 181 LDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLC 240 

Query: 448 EELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXX 507 

EELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIEN 
Sbjct: 241 EELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENPTEKIP 300 

Query: 50 8 XXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQG 567 

HSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQG 
Sbjct: 3 01 EEELPPISPEHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQG 3 60 

Query: 56 8 CHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMG 627 

CHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMG 
Sbjct: 361 CHIYEYRDSNKLVEEFMLLAN1V1AVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMG 420 

Query: 628 LPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHY 687 

LPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHY 
Sbjct: 421 LPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHY 480 

Query: 688 ALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRV 747 

AL1WPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRV 
Sbjct: 481 AL]WPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRV 540 

Query: 748 QELSIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQK 807 



QELSIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQK 
Sbjct: 541 QELSIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQK 600 

Query: 8 08 VGKKPELTLWEPDDLEEEPTQQVITIFSLVDWLQAEATALKYSAILKRPGLEKASDEE 867 

VGKKPELTLVWEPDDLEEEPTQQVITIFSLVDWLQAEATALKYSAILKRPGLEKASDEE 
Sbjct: 601 VGKKPELTLVWEPDDLEEEPTQQVITIFSLVDWLQAEATALKYSAILKRPGLEKASDEE 660 

Query: 868 PED 870 

PED 

Sbjct: 661 PED 663 

>gi | 26325590 | dbj | BAC26549 . 1 | unnamed protein product [Mus musculus] 
Length = 684 

Score = 1166 bits (3016), Expect = 0.0 

Identities = 569/590 (96%) , Positives = 570/590 (96%) 

Query: 150 PLQQSRKGWSGPDVIIEAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMK 209 

P +RKGWSGPDVIIEAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMK 
Sbjct: 14 PCS S ARKGWS G PDVI I EAQFDD S D S EDRHGNTS GLVDGVKKL S I S T PDRGKED S S T P VMK 73 

Query: 210 DEMTPIPQDTRGLSEKSLQKSAKVVYILEKKHSRAATGlLKiLADKNSDLFKKYALFSPS 269 

DENTPIPQDTRGLSEKSLQKSAKVVYILEKKHSRAATGILKLLADK^ 
Sbjct: 74 DENTPIPQDTRGLSEKSLQKSAKVVYILEKKHSRAATGILKLLADKNSDLFKKYALFSPS 13 3 

Query: 270 DHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEP 329 

DHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEP 
Sbjct: 134 DHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGE1EP 193 

Query: 33 0 ETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLD 3 89 

ETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLD 
Sbjct: 194 ETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLD 253 

Query: 39 0 DALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEE 44 9 

DALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEE 
Sbjct: 254 DALACRRLTDGTFEVGVHIADVSYFVPEGSSLDECVAAERATSVYLVQECVVPMLPRLLCEE 313 

Query: 450 LCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXX 509 

LCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIEN 
Sbjct: 314 LCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENPTEKIPEE 373 

Query: 510 XXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCH 569 

HSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCH 
Sbjct: 374 ELPPISPEHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCH 433 

Query: 570 IYEYRDSNKXVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLP 629 

IYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLP 
Sbjct : 434 IYEYRDSNKLVEEFMLLA^MAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLP 4 93 

Query : 63 0 MDVSSAGALNKSLTKTFGDDKYSLARKEVLTMMYSRPMQMALYFCSGMLQDQEQFRHYAL 689 

MDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALi 
Sbjct: 494 MDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYAL 553 

Query: 690 OTPLYTHFTS PI RRFADVI VHRLLAAALGYSEQPDVEPDTLQKQADHCND 739 

NVPLYTHFT S P I RRFADVI VHRLLAA LGYSEQPDVEPDTLQKQADHCND 
Sbjct: 554 NVPLYTHFTS PI RRFADVI VHRLLAAGLGYSEQPDVEPDTLQKQADHCND 603 

>gi | 23271317 | gb | AAH36113 . 1 | Unknown (protein for MGC : 33943) [Homo sapiens] 
Length - 612 

Score = 916 bits (2368), Expect = 0.0 

Identities = 451/532 (84%), Positives = 476/532 (89%), Gaps = 2/532 (0%) 

Query: 4 9 IFETYMSKEDVSEGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNG 108 

IFETYMSKEDVSEGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNG 



Sbjct: 42 IFETYMSK£DVSEGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNG 101 

Query: 10 9 DLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWS-GPDVIIEA 167 

. DL WVKLLPE + WK VKPESNDKE EA YE+DIPEE CGHH QQS K ++ PDVI+EA 
Sbjct: 102 DLWVKLLPEEHWKWKPESNDKETEAAYESDIPEELCGHHLPQQSLKSYNDSPDVIVEA 161 

Query: 168 QFDDSDSEDRHGNTSG-LVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKS 226 

QFD SDSED HG T LVDGVKKLS+ ++G+ED PV KDE T I QDTR LSEKS 
Sbjct: 162 QFDGSDSEDGHGITQNVLVDGVKKLSVCVSEKGREDGDAPVTKDETTCISQDTRALSEKS 221 

Query: 22 7 LQKSAKVVYILEKKHSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQD 286 

LQ+SAKWYILEKKHSRAATG LKLLADKNS+LF+KYALFSPSDHRVPRIYVPLKDCPQD 
Sbjct: 222 LQRSAKVWILEKXHSRAATGFLKLLADKNSELFRKYALFSPSDHRVPRIYVPLKDCPQD 281 

Query: 287 FMTRPKDFANTLFICRI IDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFS 346 

F+ RPKD+ANTLFICRI+DWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFS 
Sbjct: 282 FVARPKDYANTLFICRIVDWfCEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFS 341 

Query: 347 SEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGV 4 06 

SEVLECLPQ LPWTIPP+E KRRDLRKDCIFTIDPSTARDLDDAL+C+ L DG F+VGV 
Sbjct: 342 SEVLECLPQGLPWTIPPEEFSKRRDLRKDCIFTIDPSTARDLDDALSCKPLADGNFKVGV 401 

Query: 407 HIADVSYFVPEGSSLDKVAAERATSWLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIW 466 

HIADVSYFVPEGS LDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPM+DKLTFSVIW 
Sbjct: 402 HIADVSYFVPEGSDLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMSDKLTFSVIW 461 

Query: 467 KLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQA 526 

LTPEGKIL+EWFGRTIIRSCTKLSY+HAQSMIE+ HS EEVHQA 

Sbjct: 462 TLTPEGKILDEWFGRTIIRSCTKLSYEHAQSMIESPTEKIPAKELPPISPEHSSEEVHQA 521 

Query: 527 VLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNK 578 

VLNLH IAKQLR+QRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYR+SNK 
Sbjct: 522 VLNLHGIAKQLRQQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRESNK 573 

>gi | 21753324 | dbj | BAC04324 . 1 1 unnamed protein product [Homo sapiens] 
Length = 603 

Score = 812 bits (2098), Expect = 0.0 

Identities = 410/527 (77%), Positives = 436/527 (82%), Gaps = 2/527 (0%) 

Query : 1 MNHPDYKLNLRXXXXXXXXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXIFETYMSKEDVS 6 0 

M+HPDY+ +NLR D IFETYMSKEDVS 

Sbjct : 1 MSHPDYRMNLRPLGTPRGVSAVAGPHDIGAS PGDKKS KNRSTRGKKKS IFETYMSKEDVS 6 0 

Query: 61 EGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNGDLVWKLLPEDQ 120 

EGLKRGTL I QGVLRINPKKFHEAF IPS PDGDRD I F I DGWARNRALNGDLVWKLLPE + 
Sbjct : 61 EGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNGDLVWKLLPEEH 12 0 

Query: 121 WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWS-GPDVIIEAQFDDSDSEDRHG 179 

WK VKPESNDKE EA YE +D I PEE CGHH QQS K ++ PDVI+EAQFD SDSED HG 
Sbjct: 121 WKVVKPESNDKETEAAYESDIPEELCGHHLPQQSLKSYNDSPDVIVEAQFDGSDSEDGHG 180 

Query: 180 NTSG-LVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKVVYILE 238 

T LVDGVKKLS-f- ++G+ED PV KDE T I QDTR LSEKSLQ+SAKWYILE 
Sbjct: 181 ITQOTLVDGVKKLSVCVSEKGREDGDAPVTIODETTCISQDTRALSEKSLQRSAKVVYILE 24 0 

Query: 239 KKHSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTL 298 

KKHSRAATG LKLLADKNS+LF+KYALFSPSDHRVPRIYVPLKDCPQDF+ RPKD+ANTL 
Sbjct: 241 KKHSRAATGFLKLLADKNSELFRKYALFSPSDHRVPRIYVPLKDCPQDFVARPKDYANTL 300 

Query: 299 FICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLP 358 

FICRI+DWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFS SEVLECLPQ LP 
Sbjct: 301 FICRIVDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQGLP 360 



Query: 359 WTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEG 418 



WTIPP+E KRRDLRKDCIFTI-DPSTARDLDDAL+C+ L DG F+VGVHIADVSYFVPEG 
Sbjct: 361 WTIPPEEFSKRRDLRKDCIFTIDPSTARDLDDALSCKPLADGNFKVGVHIADVSYFVPEG 420 



Query: 419 SSLDKVAAERATSVYLVQK^PMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEW 478 

S LDKVAAERATSWLVQKWPMLPRLLCEELCSLNPM+DKLTFSVIW LTPEGKIL+EW 
Sbjct: 421 SDLDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMSDKLTFSVIWTLTPEGKILDEW 480 

Query: 479 FGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQ 525 

FGRTIIRSCTKLSY+HAQSMIE+ HS EEVHQ 

Sbjct: 481 FGRTIIRSCTKLSYEHAQSMIESPTEKIPAKELPPISPEHSSEEVHQ 527 

>gi | 21292324 | gb | EAA04469 . 1 | agCP3212 [Anopheles gambiae str. PEST] 
Length = 7 94 

Score = 424 bits (1089) , Expect = e-117 

Identities = 249/672 (37%), Positives = 379/672 (56%), Gaps = 35/672 (5%) 

Query: 196 PDRGKEDSSTPVMKDENTPIPQD- - TRGLSEKSLQKSAK- WYILEKKHSRAATGXLKLL 252 

P +DS+T + N ++ T+ KS Q S W ILEK+H+R G K L 

Sbjct: 127 PECAPSDDSATSPNANGNDESAEEDATQQAGGKSTQNSVGFVVAILEKRHNRQCVG- -KFL 184 

Query; 253 ADKNTSDLFKKYALFS PSDHR VPR I YVPLKDCPQDFM TRP 2 91 

A K Y +F P D R+P + V +D P + T 

Sbjct: 185 AAAPGK- - KHYRVFMPRDMRI PPVRVFKQDWPNALLSTNIAKLKDDKEEKEDRKVPQTGD 242 

Query: 292 KDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLE 351 

D + L+ II+W+++ +G + KS+G+ G +E E E IL E+ +D + + +L 

Sbjct: 243 VD VTD VL YQAE 1 I E WQDEV- -PI GT I LKS IGKCGVLE VENES I L VEHNLD VTP YGEAI LA 300 

Query: 3 52 CLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADV 411 

LP ++P+ IP +E+ +R DLR +CIFTIDP+TARDLDDAL+C++L +G +++GVHI+DV 
Sbjct: 3 01 QLP-AVPYCIPQEELERREDLRGECIFTIDPATARDLDDALSCKQLENGNYQIGVHISDV 3 59 

Query: 412 SYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPE 471 

+YF+ E S LD++ RATS+Y+V V MLP+ LC CSL P DKL FSV W++ P+ 
Sbjct: 360 TYFLRESSPLDELVKLRATSIYMVDTVYHMLPKQLCNT-CSLLPGEDKLAFSVFWEMQPD 418 

Query: 472 GKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLH 531 

G +L F RT+I SC++LSY+HAQ M++N " ++ +++ + V L 

Sbjct: 419 GTVLSTRFARTVINSCSQLSYEHAQLMLDNPSCEVEEDQFPEIMHGYNAKQLCKIVNTLQ 478 

Query: 532 SIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAV 591 

SIA QLR++R DG L+ ++Q KL F LD TG P +Y+ R SN+++E+FMLLAN +V 
Sbjct: 479 SIAVQLRQRRMDDGCLKINQPKLTFRLDPATGRPIEYGVYKVRPSNEMIEDFMLLANTSV 538 

Query: 592 AHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKY 651 

A+ I++ FPE +LLR H PP M+ +LV G + +S + + + + 

Sbjct: 539 ANAIYKAFPEISLLRAHSPPAENMMKNLVRTLSLHGHALSYASPKDIRECMETIITTSEN 598 

Query: 652 SLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHR 711 

A + VL+ + ++PM A Y+CS E F HYAL +P+YTHFTSPIRR+AD +VHR 

Sbjct: 599 PDATRSVLSVLLAKPMIRAQYYCSLYATTPEHFMHYALAIPMYTHFTSPIRRYADCLVHR 658 

Query; 712 LLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAM 771 

+LAAAL QP P+ LQ A CN+ + + +K E S L+F ++ G E+EA 
Sbjct: 6 59 VIAAALAIDVQPKRSPEELQCLAMICNEKKYNAKCAGEASSLLYFRHWLEAVGEYETEAA 718 

Query: 772 VMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQV 831 

VMG + + + + + G+ + L+ + K +P + + P+D P + 

Sbjct: 719 VMGYAAHH I E L VL I H S G I VL KAAT KKL S T VAT WY K - P TE P VAS CML I PND T S I P P VE - - 775 

Query: 832 ITIFSLVDWLQ 843 

+TIF+ V V ++ 
Sbjct: 776 LTIFTKVRVTVK 787 * 



>gi 1 19115422 I ref [NP_594510 . 1 1 ribonuclease II RJSTB family protein; dis3-like 
[Schizosaccharomyces 
pombe] 

gi | 7493295 | pir j J T38518 ribonuclease II RNB family protein - fission yeast 

(Schizosaccharomyces pombe) 
gi|2414618|emb|CAB16367.l| ribonuclease II RNB family protein; dis3-like 
[Schizosaccharomyces 
pombe] 
Length =92 7 

Score = 418 bits (1075) , Expect = e-115 

Identities = 273/764 (35%), Positives^ = 392/764 (51%), Gaps = 91/764 (11%) 

Query: 49 I FETYMS KED VS EGLKRGTL I QGVLRI NPKKFHEAF IPS PDGDRD I F I DG WARNRALNG 108 

+ + Y V +GLK GTL +G LRI H + + D ++DG +ARNRA + 

Sbjct: 175 VYPLYYDSATVKKGLKSGTLFKGTLRILEN- -HRSAFACMEDIPDFYVDGPIARNRAFHN 232 

Query: 109 DLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQ 168 

D+V+V E DPE LQ +++ 
Sbjct: 233 DWIV EPVMNNDSPTEKSNF- -LQNG VEKVK 261 

Query: 169 FDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQ 228 

DDE G ++ +++L I + K DS T 
Sbjct: 262 IKDHDDE LGGAMEHLERLEIKSVASFKGDSRT . 293 

Query: 229 KSAKWYILEKKHSRAATGILKL- -LADKNSDLFKK YALFSPSDHRVPRIYVPLKDC 283 

A+W I ++ GIL+ + KN + K YA+F P D R+P I + D 

Sbjct: 294 -RARWAIEKKAEISKIVGILRAPGWSLKOTEWSKKSSYAIFIPKDKRLPFITIHKNDL 352 

Query: 284 P QDFMTRPKD FANTLF I CR 1 1 D WKEDCNFALGQLAKS LGQAGE I E P ETEG I LTE YG 339 

+ + + + +' LF I W + +G L + LG ++E T +L E G 

Sbjct: 353 SDLSGENW I ENI LKHHDQL FS VE I TRWS I YS RYPMGVLGEKLGN I TDVEAYTNALLLENG 412 

Query: 340 VDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTD 399 

+ S FS EVL CLP W I +E+ KRRDLR + I TIDP TARDLDDA++CR L + 
Sbjct: 413 ISSSPFSDEVLNCLPPD-DWIISHEEIKKRRDLRNELIITIDPETARDLDDAVSCRALDN 471 

Query: 400 GTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDK 459 

GT+EVGVHIADV++FV S+LDK AA RAT+VYLVQK + PMLP LLCE LCSLNP + + 
Sbjct: 472 GTYEVGVHIADVTHFVKPDSALDKEAASRATTVYLVQKAI PMLPPLLCERLCSLNPNVER 531 

Query: 460 LTFSVIWKLTPEGK- ILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXH 518 

L FSV WKL GK I + WFG+T+I++C +L+Y AQ +IE H 
Sbjct: 532 LAFSVFWKLDSNGKEIGKRWFGKTVIKTCARLAYSEAQGVIEG- -KSWDDAVGKPIGGTH 589 

Query: 519 SVEE VHQAVLNLHS I AKQLRRQRFVDGALRLDQLKLAFTLDHETGL PQGCH I YE YRDSNK 578 

+ ++V ++L L I+++LR+ RF GA+ ++ +L F LD E G-f-P C +YE D+N 
Sbjct: 590 TPKDVETSILTLCEISRKLRKDRFAKGAVEINSTELBCFQLD-EYGMPNKCEVYEQTDAMH 648 

Query: 57 9 LVEEFMLLAKMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGAL 63 8 

L+EEFMLLAN +VA I + F +LLRRH P+ K +++ F M D SS+ A 
Sbjct: 64 9 LIEEFMLLANRSVAEHISKNFSNNSLLRRHASPKEKQINEFCHFLKSMNFDFDASSSAAF 708 

Query: 639 NKSLTK TFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYAL3WPLYT 695 

N S+ + TF E+ NM R + A YFC+G ++ + HYAL+ YT 

Sbjct: 709 NASMVRLRSTFNEELV ELFENMAVRSLNRAEYFCTGDFGEKTDWHHYALSFNHYT 763 

Query: 696 HFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLF 755 

HFTSPIRR+ D+IVHRLL +L + P ++ A HCN+ + + S VQE S LF 

Sbjct: 764 HFTSPIRRYPDIIVHRLLERSLK-NTSPGIDKKNCSLVAAHCNEKKEKSTTVQEDSQQLF 822 



Query: 756 FAVLVKE SGPLESEAMVMGVLNQAFDVLVLRFGVQKRI 7 93 

■+V + E + +A + + DV + +G+ R+ 

Sbjct: 823 LS VYIAEYCKKHDKKSMPVQAFATRI SGNS IDVYI SEYGI SNRV 866 



>gi 1 15220899 j ref |NP_177891 . 1 1 putative 3' -5' exoribonuclease ; protein id; 
Atlg77680 . 1 

[Arabidopsis thaliana] 
Length = 10 55 

Score = 414 bits (1063) , Expect = e-114 

Identities = 277/810 (34%), Positives - 415/810 (51%) , Gaps = 78/810.(9%) 

Query: 4 9 I FET YMS KEDVS EGLKRGTL I QGVLRINPKKFHEAF I PS PDGDRD I F IDGWARNRALNG 108 

IF + + S + V+E L++G + + R+N +EA+ DI I+G V ++RA+ G 

Sbjct: 147 IFSSHWSLDAVTEALEKGEAFKALFRVNAHNRNEAYCKIDGVPTDILINGNVCQSRAVEG 206 

Query: 109 DLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQ 168 

D W+KL P W +K E+ EG P + +K + + 

Sbjct: 207 DTWIKLDPLSLWPKMK GFVTESAAKPEGTNSPPEKDDKKARQKNGIDWEG 258 

Query: 169 FDDSDSEDRHGNTSGLVDGVKK-LSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSL 227 

F+D S + + + +S + G K ++ S + P + + N G+ 

Sbjct: 259 FEDGFSKNK SSVIGKGAKNGVTPSSPPSLDSCLGSFCEQKGNCSAVDKLCGILSSFP 315 

Query: 228 QK- -SAKVVYILEKKHSR-AATGILKL LADKNSDLFK KYALFSPSDH 271 

K + +W ++EK R + G+L + + K SD + +Y P+D 

Sbjct: 316 HKRPTGQWAWEKSLVRDSIVGLLDVKGWIHYKESDPKRCKSPLSLSDDEYVQLMPADP 375 

Query: 272 RVPRIYVPLKDCPQDFMTRPK- DFANTLFICRI IDWKEDCNFALGQLAKSLGQAGEI 327 

R P++ VP P R + + L +I+DW E F + Q+ G+ E+ 

Sbjct: 376 RF PKL I VP FHVL PGS I RARLENLDPNLEAELVAAQ I VD WGEGS P F P VAQ I THLFGRGS EL 435 

Query: 328 EPETEGILTEYGVDFSDFSSEVLEGLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARD 3 87 

EP+ IL + V SDFS L LP+ +PW +P +EV +R+DLR C+ TIDPSTA D 
Sbjct: 436 EPQINAILYQNSVCDSDFSPGSLTSLPR- VPWEVPEEEVQRRKDLRDLCVLTIDPSTATD 494 

Query:- 388 LDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKVVPMLPRLLC 447 

LDDAL+ + L G F VGVHIADVSYFV + + LD A R+TSVYL+Q+ + MLP LL 
Sbjct: 495 LDDALSVQSLPGGFFRVGVHIADVSYFVLPETALDTEARFRSTSVYLMQRKISMLPPLLS 554 

Query: 448 EELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXX 507 

E + SL+P D+L FS++W L EG + + + W GRTI IRSC ' KLSYDHAQ +1+ 
Sbjct: 555 ENVGSLSPGADRLAFSILWDLNREGDVIDRWIGRTIIRSCCKLSYDHAQDIIDGKSDVAE 614 

Query: 508 XXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQG 567 

+V + +V L 1+ LR++RF +GAL+L+ K F D E G+P 
Sbjct: 615 NGWPALHGSFKWC-DVTRSVKQLSEISTTLRQKRFRNGALQLENSKPVFLFD-EHGVPYD 672 

Query: 568 CHIYEYRDSNKXiVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMG 627 

+ SN LVEEFMLLANM A I + + -f-LLRRHP P T+ L + FC + G 
Sbjct: 673 FVTCSRKGSNFLVEEFMLLANMTAAEVISQAYRASSLLRRHPEPNTRKLKEFEGFCSKHG 732 

Query: 62 8 LPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQ-EQFRH 686 

+ +D+SS+G L SL K G+ K ++L N +PMQ+A YFC+G L+D ++ H 

Sbjct: 733 MDLDI S S SGQLQDSLEKI TGNLKDDS VFVD I LNNYAI KPMQLAS YFCTGNLKDSVAEWGH 792 

Query: 687 YALNVPLYTHFTSPIRRFADVIVHR LLAAALGYSEQPDVEPD 728 

YAL VPLYTHFTSP+RR+ D++VHR L A YS+Q D 

Sbjct: 793 YALAVPLYTHFTSPLRRYPDIWHRALAAALEAEELYSKQKQTAIDEGRSCFTGIHFNKD 852 

Query: 729. TLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESG 764 

L A +CN+R++A+ + +V++ L+ ++K+ 

Sbjct: 853 AAESIEGKEALSVAALKHGVPSTEILSDVAAYCNERKLAARKVRDACDKLYTWFVLKQKE 912 

Query: 765 PLESEAMVMGVLNQAFDVLVLRFGVQKRIY 794 

EA VM + ++ V + + G+++RIY 
Sbjct: 913 I F PCEARVMNLGS RFMTVY I S KLG I ERR I Y 942 



>gi | 25406521 |pir | |E96806 hypothetical protein T32E8.1 [imported] - 
Arabidopsis thaliana 

gi | 12323300 |gb |AAG51632 . 1 |AC012193_14 putative 3*-5' exoribonuclease , 3* 
partial; 3320-1 [Arabidopsis 

thaliana] 
Length = 935 

Score = 402 bits (1033), Expect = e-110 

Identities = 274/792 (34%), Positives = 404/792 (51%), Gaps = 80/792 (10%) 

Query: 49 IFETYMSKEDVSEGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNG 108 

IF ++ S + V+E L++G + + R+N +EA+ " DI I+G V ++RA+ G 

Sbjct: 147 I FS SHWSLDAVTEALEKGEAFKALFRVNAHNRNEAYCKIDGVPTD I LINGNVCQSRAVEG 206 

Query: 109 DLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQ 168 

D W+KL P W +K E+ EG P + +K + + 

Sbjct: 207 DTWIKLDPLSLWPKMK GFVTESAAKPEGTNS PPEKDDKKARQKNGIDWEG 258 

Query: 169 FDDSDSEDRHGNTSGLVDGVKK-LSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSL 227 

F+D S + + + +S + G K ++ S + P + + N G+ 

Sbjct: 259 FEDGFSKNK SSVIGKGAKNGVTPSSPPSLDSCLGSFCEQKGNCSAVDKLCGILSSFP 315 

Query: 228 QK- -SAKWYILEKKHSR-AATGILKL LADKNSDLFK KYALFSPSDH 271 

K + +W + + EK R + G+L + + K SD + +Y . P+D 

Sbjct: 316 HKRPTGQWAWEKS LVRDSIVGLLDVKGWIHYKESDPKRCKSPLSLSDDEYVQLMPADP 375 

Query: 272 RVPRIYVPLKDCPQDFMTRPK DFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEI 327 

R P+ + VP P R + + L +I+DW E F + Q+ G+ E+ 

Sbjct: 376 RFPKLIVPFHVLPGSIRARLENLDPNLEAELVAAQI VDWGEGSPFPVAQITHLFGRGSEL 435 

Query: 328 EPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARD 387 

EP+ IL + V SDFS L LP+ +PW +P +EV +R+DLR C+ TIDPSTA D 
Sbjct: 436 EPQINAILYQNSVCDSDFSPGSLTSLPR- VPWEVPEEEVQRRKDLRDLCVLTIDPSTATD 494 

Query: 388 LDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLC 447 

LDDAL+ + L G F VGVHIADVSYFV ++LD A R+TSVYL+Q+ + MLP LL 
Sbjct: 495 LDDALSVQSLPGGFFRVGVHIADVSYFVLPETALDTEARFRSTSVYLMQRKISMLPPLLS 554 

Query: 448 EELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXX 507 

E + SL+P D+L FS + +W L EG + + + W GRTIIRSC KLSYDHAQ +1 + 
Sbjct: 555 ENVGS LS PGADRLAFS I LWDLNREGDVTDRWIGRT 1 1 RS CCKLS YDHAQD 1 1 DGKSDVAE 614 

Query: 508 XXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQG 567 

>V ++V L 1+ LR++RF +GAL+L+ K F D E G+P 
Sbjct: 615 NGWPALHGSFKWC-DVTRSVKQLSEISTTLRQKRFRNGALQLENSKPVFLFD- EHGVPYD 672 

Query: 568 CHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMG 627 

+ SM LVEEFMLLANM A I + + +LLRRHP P T+ L + FC + G 
Sbjct: 673 FVTCSRKGSNFLVEEFMLLANMTAAEVISQAYRASSLLRRHPEPNTRKLKEFEGFCSKHG 732 

Query: 628 LPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQ-EQFRH 686 

+ +D+SS+G L SL K G+ K ++L N +PMQ+A YFC+G L+D ++ H 

Sbjct: 733 MDLDISSSGQLQDSLEKITGNLKDDSVFVDILNNYAIKPMQIiASYFCTGNLPCDSVAEWGH 792 

Query: 687 YALNVPL YTHFTS P IRRFADVI VHR LLAAALGYSEQPDVEPD 728 

YAL VPLYTHFTSP+RR+ D++VHR LA YS+Q D 

Sbjct: 793 YALAVPLYTHFTSPLRRYPDIWHRALAAALEAEELYSKQKQTAIDEGRSCFTGIHFNKD 852 

Query: 729 TLQKQADHCNDRRMASKRVQELS IGLF - - FAVLVKE 762 

L A +GN+R++A+ + +V++ L+ F + KE 
Sbjct: 853 AAESIEGK^ALSVAALKliGVPSTEILSDVAAYCNERKliAARKVRDACDKLYTWFVLKQKE 912 



Query: 763 SGPLESEAMVMG 774 

P E+ M +G 
Sbjct: 913 IFPCEARVMNLG 924 



>gi I 25332397 |pir I (A84553 probable mitotic control protein dis3 [imported] 
Arabidopsis 

thaliana 
Length = 955 

Score = 396 bits (1018) , Expect = e-109 

Identities = 252/757 (33%), Positives = 407/757 (53%), Gaps = 84/757 (11%) 

Query: 4 9 I FETYMSKEDVSEGLjKRGTLIQGVLRINPKKFHEAFI PS PDGDRDI FIDG WARNRALNG 10 8 

I + + +. + + + GL RG QG LR+N + EA+ + S +1 I G NRA +G 

Sbjct: 215 I YQEHKPMS E I TAGLHRG I YHQGKLRVNRFNP YEAYVGS ESI GEE 1 1 1 YGRSNMNRAFDG 274 

Query: 109 DLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQ 168 

D+V V+LLP DQW+ ++K + EA I S DV+ + 

Sbjct: 275 DIVAVELLPRDQWQ DEKALSIAEEAVI SLHDWLNLS 311 

Query: 169 F DDSDSEDRHGNTSGL VDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRG 221 

F +S + ++D +T L VD + S + + + ++ PV 
Sbjct: 312 FFQISNSNADDEEDDTVHLAPDNVDDAPRTSNLSHETSGDKNAAPV 357 

Query: 222 LSEKSLQKSAKWYILEKKHSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLK 281 

+ S +W ++ + + + + G L+ +-f +ALF D R+P+I + + 

Sbjct: 358 RPSGRWGVI-RRNWHSYCGSLEPMSLPAGSGGTAHALFVSKDRRIPKIRINTR 410 

Query: 282 DCPQDFMTRPKDFANTLF I CRI I DWKEDCNFALGQLAKSLGQAGE I E PETEGI LTEYGVD 341 

+ +++ + W +G+ +G+ G+ E ETE +L E VD 

Sbjct: 411 QL QNLLDMRIWAVDSWDRQSRYPSGHYVRPIGKIGDKETETEVVLIENDVD 462 

Query: 342 FSDFSSEVLECLPQSLPWTIPPDEVGK- -RRDLRKDCIFTIDPSTARDLDDALACRRLTD 399 

+S FSS+VL CLP LPW+ + ++V R+DLR +F++DP +D+DDAL C L + 
Sbjct: 463 YSPFSSQVLACLP- PLPWSVSSEDVSNPVRQDLRHLLVFSVDPPGCKDIDDALHCTSLPN 521 

Query: 400 GTFEVGVHIADVSYFVPEGSSLDKVAAERATS VYLVQKWPMLPRLLCEELCSLNPMTDK 459 

G FE+GVHIADV+ FV G+ LD A++R TSVYLV++ + MLP+ L E++CSL + + 
Sbjct: 522 GNFELGVHIADVTNFVHPGTPLDDEASKRGTSVYLVERRIDMLPKPLTEDICSLRADVER 581 

Query: 460 LTFSVIWKXTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHS 519 

L FSVIW+++P+ +1+ F ++II+S ■ LSY AQ+ +++ 
Sbjct: 582 LAFSVIWEMSPDAEIISTRFTKSIIKSSAALSYIEAQARMDDSRL 626 

Query: 520 VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKL 579 

+ + + N+ + + +AK +R-f+R GAL L F +D E P +Y+ + 

Sbjct: 627 TDSLTTDLRNMNTLAKIMRQRRIDRGALTLASAEVKFDIDPENHDPLNIGMYQILEANQM 686 

Query: 580 VEEFMLLANKAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALN 639 

VEEFML AN-f+VA +1 + FP +LLRRHP P +ML L+ +GL +DVSS+ AL 

Sbjct: 687 VEEFMIJUlNVSVAGQILKXjFPSCSLLRRHPTPTREMLEPLLRTAAAIGLTLDVSSSKALA 746 

Query: 640 KSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTS 699 

SL + G+D Y + + + + +R M A+YFCSG L E + HY L PLYTHFTS 

Sbjct : 747 DSLDRAVGEDPYF NKL I R I LATRCMT QAVYF CSGDLSPPE - YHH YGLAAP L YT HF T S 802 

Query: 70 0 PIRRFADVIVHRLLAAALGYSEQPDVEPD- -TLQKQADHCNDRRMASKRVQELSIGLFFA 757 

PIRR+ADV VHRLLAA+LG + P V D L AD+ N R ++ S+ L+ 

Sbjct: 803 PIRRYADVFVHRLLAASLGIYKLPTVFQDRPQLTSVADNLNYRHRNAQMAGRASVELYVL 862 

Query: 758 VLVKE SGPLES EAMVMGVLNQAFD VLVLRFGVQKR I Y 7 94 

+ ++P+EAV+++ FVV + +G+ + +Y 
Sbjct: 863 IYFR-TRPTDEEARWKIRSNGFIVFVPKYGIEGPVY 898 

>gi | 24654592 | ref | NP_72 84 90 . 1 | CG1694 0-PC [Drosophila melanogaster] 
gi j 23 092667 |gb|AAN1142 5 . 1 1 AE003467_3 7 CG1694 0-PC [Drosophila melanogaster] 
Length = 1044 



Score = 367 bits (942) , Expect = e-100 

Identities - 249/739 (33%), Positives = 372/739 (50%), Gaps = 82/739 (11%) 



Query: 66 GTL I QGVLR INPKKFHEAF I PS PDGDRDI F I DGV VARNRALNGDL WVKLL - PEDQ 120 

G +++ +R+N K +AFI + DG+ VAR A +GD V +L P Q 

Sbjct: 308 GRIVEEEIRWRKNNRQAFIIMSTDREALERDGIVLLPVARRYAFDGDKVRAFVLNPGAQ 367 

Query: 121 WKAVKPESNDKEIEATYEADI PEEGCGHHPLQQSRKGWSGPD VI IEAQFDDSDSEDRHGN 180 

+ E+EI GP G E DD++S+ + 

Sbjct: 368 GSSKTAEPSSGEIS- GGKPSLSLADG EELSDDTESQGSESD 407 

Query: 181 TSGLVDGVKKXSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKVVYILEKK 240 

T +V V+ +N P A V+ I ++ 

Sbjct: 4 08 TDNW VISSDNCP KAFVIAITKRT 431 

Query: 241 HSRAATGIL KLLADKNSDLFKKYALFSPSDHRVPRIWPLKDCPQDFMTRPK-D 293 

R G + KLD+ LF K FPD RVP +YVP C + + D 

Sbjct: 432 ELRQIVGTISFTNPTKLCDDQ LFYK FRP YDLRVPMVYVPKDACAAH I GNKQQ I D 485 

Query: 294 FANTLFICRIIDWKEDCN-FALGQLAKSLGQAGEIEPETEGILTEYGV-DFSDFSSEVLE 351 

+ L++ I++ DCN + +L + +G+ G ++ E + IL G+ D F ++ 
Sbjct: 486 VSGLLYLAHILE- -TDCNGHCIAELIQPVGRVGNLDDELKAILFHNGLRDIKPFEQRFID 543 

Query: 352 CLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADV 411 

Q P I +++ +R+DLRK CIFTIDP TARDLDDA+ + +L D +E+GVHI+DV 
Sbjct: 544 I YSQPPP -PI SQEDLRQRKDLRKMC I FT I DPMTARDLDDAVS I EKLGDNE YE IGVHI SDV 602 

Query: 412 SYFVPEGSSLDK^AAERATSVTLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPE 471 

S + F+ E + LD + ER+TS+YL +V+ MLP+ LC CSL P DK FSV W++ + 
Sbjct: 603 SHFLIEDNELDNIVKERSTSIYIiANEVIHMLPQSLCMR-CSLLPGQDKFAFSVFWRMNGK 661 

Query: 472 GKILEE- -WFGRTI IRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLN 529 

G +L+ + F RT+I SC++ +Y+HAQ +I+N + +++ VL 

Sbjct: 662 GVMLQKKPEFCRTVINSCSQFAYEHAQKIIDNPNERFTENDFPTILNGFNPDDIRNRVLW 721 

Query: 53 0 LHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANM 589 

LH IA +R+ R +GAL ++ KL F LD TG P + + R++N+L+EEFMLLAN 
Sbjct: 722 LHDIASSIRKTRLDNGALTINNAKLRFLLDPITGEPLSFEVEKQREANRLIEEFMLLANQ 781 

Query: 590 AVAHKI FRTFPEQALLRRHP PPQTKMLSDL VEFCDQMGL PMD VS SAGALNKS LTKTFGDD 64 9 

AVA I +FP+ A+LR HPPP K L L E +G +D SS+ AL +S+ + + 
Sbjct: 782 AVARFIHDSFPDIAVLRNHPPPLIKSLKALREKLLALGFELDYSSSKALQESMVRLCNEA 841 

Query: 650 KYS LARKE VLTNMYSRPMQMAL YFCS GMLQDQEQFRHYALNVPLYTHFTS P IRRFADVI V 709 

+A L+ + +PM A YFCS + HYAL+ + P+ YTHFTS PIRR+ D++V 

Sbjct: 842 PNPVAMNACLSQLLMKPMARATYFCSEGKSEPADLWHYALS I P I YTHFTS P IRRYPD I LV 901 

Query: 710 HRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESE 769 

HRLLAAAL Y P PD L N+R+ +K+ E S L+F V + 

Sbjct: 902 HRLLAAALKYCTPPK^TPDDLHTLTKLANERKYNAKKAGEDSGNLYFK^YVHNKQGIYMR 961 

Query: 770 AMVMGVLNQAFDVLVLRFG 788 

A+V+ + +V+ L G 

Sbjct: 962 AWIEIFQHMMNWTLESG 980 



>gi 


: 19922976 


gi 


'24654597 


gi 


16198179 


gi 


23092668 


gi 


23092669 



ref |NP_612 012 . 1 
ref |NP_728491.1 
gb |AAL13898 . 1 
gb|AAF4 7351.2 
gb|AAN11426.1 
Length - 1032 



CG16940-PA [Drosophila melanogaster] 
CG16940-PB [Drosophila melanogaster] 
LD37985p [Drosophila melanogaster] 
CG1694 0-PA [Drosophila melanogaster] 
AE003467 39 CG16940-PB [Drosophila melanogaster] 



Score = 367 bits (942), Expect = e-100 * 

Identities = 249/739 (33%), Positives = 372/739 (50%), Gaps = 82/739 (11%) 



r 



Query: 66 GTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGV VARNRALNGDLWVKLL - PEDQ 12 0 

G +++ +R+N K +AFI + DG+ VAR A +GD V +L P Q 

Sbjct: 296 GRIVEEEIRVNRKNNRQAFI IMSTDREALERDGIVLLPVARRYAFDGDKVRAFVLNPGAQ 355 

Query: ,121 WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQFDDSDSEDRHGN 180 

+ E+EI GP G E DD++S+ + 

Sbjct: 356 GSSKTAEPSSGEIS GGKPSLSLADG EELSDDTESQGSESD 395 

Query: 181 TSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKWYILEKK 240 

T +V V+ +N P A V+ I ++ 

Sbjct: 396 TDNW : VISSDNCP KAFVIAITKRT 419 

Query: 241 HSRAATGIL KLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPK-D 293 

R G + KL D + LF K F P D RVP +YVP C + + D 

Sbjct: 420 ELRQIVGTISFTNPTKLCDDQ LFYK FRPYDLRVPMVYVPKDACAAHIGNKQQID 473 

Query: 294 FANTLFICRIIDWKEDCN-FALGQLAKSLGQAGEIEPETEGILTEYGV-DFSDFSSEVLE 351 

+ L+ + I++ DCN + +L + +G+ G ++ E + IL G+ D F ++ 
Sbjct: 474 VSGLLYLAHILE - -TDCNGHCIAELIQPVGRVGNLDDELKAILFHNGLRDIKPFEQRFID 531 

Query: 352 CLPQSLPWTI PPDEVGKRRDLRKDCI FTIDPSTARDLDDALACRRLTDGTFEVGVHIADV 411 

Q P I +++ +R+DLRK ClFTIDP TARDLDDA++ +L D +E+GVHI+DV 
Sbjct: 532 IYSQPPP-PISQEDLRQRKDLRKMCIFTIDPMTARDLDDAVSIEKLGDNEYEIGVHISDV 590 

Query: 412 SYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPE 471 

S+F+ E + LD + ER+TS+YL +V+ MLP+ LC CSL P DK FSV W++ + 
Sbjct: 591 SHFLIEDNELDNIVKERSTSIYLANEVIHMLPQSLCMR-CSLLPGQDKFAFSVFWRMNGK 649 

Query: 472 GKILEE- -WFGRTI IRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLN 529 

G + L++ F RT+I SC++ +Y+HAQ +I+N + +++ VL 

Sbjct: 650 GVMLQKKPEFCRTVINSCSQFAYEHAQKIIDNPNERFTENDFPTILNGFNPDDIRNRVLW 709 

Query.: 53 0 LHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANM 58 9 

LH IA +R+ R +GAL ++ KL F LD TG P + + R++N+L+EEFMLLAN 
Sbjct: 710 LHDIASSIRKTRLDNGALTINNAKLRFLLDPITGEPLSFEVEKQREANRLIEEFMLLANQ 769 

Query: 590 AVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVS SAGALNKSLTKTFGDD 64 9 

AVA I +FP+ A+LR HPPP K L L E +G +D SS+ AL +S+ + + 

Sbjct: 770 AVARFIHDSFPDIAVLRNHPPPLIKSLKALREKLLALGFELDYSSSKALQESMVRLCNEA 829 

Query: 650 KYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIV 70 9 

+A L+ + +PM A YFCS + HYAL++P+YTHFTSPIRR+ D++V 

Sbjct: 830 PNPVAMNACLSQLLMKPMARATYFCSEGKSEPADLWHYALSIPIYTHFTSPIRRYPDILV 889 

Query: 710 HRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESE 769 

HRLLAAAL Y P PD L N+R+ +K+ E S L+F V + 

Sbjct: 890 HRLLAAALKYCTPPKRTPDDLHTLTKLANERKYNAKKAGEDSGNLYFKRYVHNKQGIYMR 949 

Query: 770 AMVMGVLNQAFDVLVLRFG 7 88 

A+V+ + +V+ L G 

Sbjct: 950 AW I E I FQHMMNWTLE SG 968 

>gi|l529261lJgb|AAK93574.l| SD10981p [Drosophila melanogaster] 
Length = 982 

Score = 351 bits (901) , Expect = 2e-95 

Identities = 249/815 (30%), Positives = 407/815 (49%), Gaps - 110/815 (13%) 

Query : 50 FET YMS KEDVS EGLKRGTL I QGVLR I NPKKFHEAF IPS PDGDRD I F I DG WARNRALNGD 109 

+ ++S +++ EGL++ L+QG ++++E + ++IIG + NRA++GD 

Sbjct: 232 YPPHLSMKELLEGLRQNKLLQGTFQASRENYLEGTVNVEKFEKGILIQGRESLNRAVDGD 291 



Query: 110 LVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQF 169 

LV- V+LLPE +W A P E + Y + + P 
Sbjct: 292 LVAVELLPEAEWSA--PSEIVLEEKNVYADEVP 322 



Query: 170 DDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQK 229 

SE+R + + +++ V+ ++S E TP 
Sbjct: 323 SEERAKDENEMLNQVRAAALSA ERTP 348 

Query: 230 SAKWYILEKKHSRAATGILK- -LLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDF 287 

+ ++V 1+ +K R GIL+ L+ D N +F P+D ++PRI + + 

Sbjct: 34 9 TGRIVGIVRRKW-RQYCGILQPSLIEDTNRH IFVPADRKIPRIRIETRQAAM- - 3 99 

Query: 288 MTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSS 347 

N I I W + + G +SLG G++ E E IL E+ V FS 
Sbjct: 400 LQNQRI IVTIDTWPRNSRYPHGHFVRSLGPLGDMATENEVILLEHDVPHCKFSD 453 

Query: 348 EVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDAIiACRRLTDGTFEVGVH 407 

EVL LP+ + PWTI + + KR DLR I ++DP D+DDAL CR L +G EVGVH 
Sbjct: 454 EVLSFLPK-MPWTITDEDYSKRVDLRDLYICSVDPPGCTDIDDALHCRELPNGNLEVGVH 512 

Query: 408 IADVSYFVPEGSSLDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWK 4 67 

IADVS+F+ G++LD AA R T+VYLV K + M+P LL LCSL ++ FS +W+ 
Sbjct: 513 IADVSHFIRPGNALDMEAAARGTTVYLVGKRIDMVPELLSSNLCSLVGGVERFAFSCVWE 572 

Query: 468 LTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAV 527 

+ E +L + F +++I+S ++Y+ AQ++I++ E+ +++ 

Sbjct: 573 VDNEANVLSKRFHKSVIKSKRAMTYEEAQNIIDDATQQ NEIAKSL 617 

Query: 528 LNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLIiA 587 

NL+ +AK L+++R +GAL L ++ F +D ET P +"+ R++N +VEEFMLLA 
Sbjct: 618 RNLNRLAKILKKRRMDNGALVIASPEIRFQVDSETHEPLEVEVKQMRETNSMVEEFMLLA 677 

Query: 588 NMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTF- 646 

N+ VA I F E A+LRRHP P LV+ G +D++S L+ SL K 

Sbjct: 678 NITVAEHIATEFSECAVLRRHPRPPPTNFDPLVKSARYQGFQVDINSGLELSHSLDKCVK 737 

Query: 64 7 GDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFAD 706 

D+ Y +LT +R M A+YF SG LQ +E+F HY L P+YTHFTSPIRR++D 
Sbjct: 738 ADNPYFNTMIRILT TRCMMQAVYFISGSLQ-KEEFFHYGLAAPIYTHFTSPIRRYSD 793 

Query: 707 VIVHRLLAAALG YSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKE 762 

++VHRLLAA++G Y++ D++++ +NR ++" S+L + + 
Sbjct: 794 IMVHRLLAASIGADSTYAQLLDRKSN- -EELCHNLNYRHKMAQYAGRASVALNTHLFFRG 851 

Query : 763 SGPLESEAJVTmGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQCTGKKPELTLVWEPDD. 822 

+ E V+ V A VL+ ++G++ +Y + ++V K E+ + +D 

Sbjct: 852 KEE-DEEGYVLFVRKNALQVLIPKYGLEGTLYLKSDKDGKDGVERV--KSEIVFTFNEED 908 

Query: 823 LEEEPTQQVITIFSLVDWLQAEATALKYSAILKR 857 

+ V F V V L +++ + + + ++ R 

Sbjct: 909 HTQRCGDWFHSFDPVTVRLSLDSSNVQHEKLIFR 943 



18488261 1 ref |XP_081229 . 1 1 Dis3 [Drosophila melanogaster] 
24649634 | ref [ NP_65 124 6 . 2 j CG6413-PA [Drosophila melanogaster] 
7301148]gb|AAF56281.l| CG6413-PA [Drosophila melanogaster] 
Length = 982 



Score = 350 bits (898)', Expect = 6e-95 

Identities = 248/815 (30%), Positives = 407/815 (49%), Gaps = 110/815 (13%) 

Query: 50 FETYMS KED VS EGLKRGTL I QGVLRI NPKKFHEAF IPS PDGDRD I F I DGWARNRALNGD 109 

+ + + S +++ EGL++ L+QG ++++E + ++ I I G + NRA++GD 

Sbjct: 232. YP PHLSMKELLEGLRQNKLLQGT FQAS RENYLEGTVNVEKFEKG I L I QGRES LNRAVDGD 291 



Query: 110 LVWKLLPEDQWKAVKPESlsrDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQF 169 

LV V+LLPE +W A P E + Y ++P 
Sbjct: 292 LVAVELLPEAEWSA- -PSEIVLEEKNVYADEVP 322 



Query: 170 DDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQK 229 

SE+R + + +++ V+ ++S E TP 
Sbjct: 323 SEERAKDENEMLNQVRAAALSA ERTP 348 

Query: 23 0 SAKWYILEKKHSRAATGILK- -LLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDF 2.87 

+ ++V 1+ +K R GIL+ L+ D N . +F P+D + + PRI + + 
Sbjct: 349 TGRI VGI VRRKW - RQ YCGI LQPSL I EDTNRH IFVPADRKIPRIRIETRQAAM- - 399 

Query: 288 MTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSS 347 

N I I W + + G +SLG G+ + E E IL E+ V FS 
Sbjct: 400 LQNQRIIVTIDTWPRNSRYPHGHFVRSLGPLGDMATENEVILLEHDVPHCKFSD 453 

Query: 34 8 EVLECLPQSLP WT I PPDE VGKRRDLRKDC I FT I DPS TARDLDDALACRRLTDGTFE VGVH 4 07 

EVL LP+ +PWTI ++ KR DLR I ++DP D+DDAL C+ L +G EVGVH 
Sbjct: 454 EVLS FLPK- MPWT I TDEDYS KRVDLRDLYI CS VDPPGCTD I DDALHCKELPNGNLEVGVH 512 

Query: 408 IADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRIiLCEELCSLNPMTDKLTFSVIWK 467 

IADVS+F+ G++LD AA R T+VYLV K + M+P LL LCSL ++ FS +W+ 
Sbjct: 513 IADVSHFIRPGNALDMEAAARGTTVYLVGKRIDMVPELLSSNLCSLVGGVERFAFSCVWE 572 

Query: 468 LTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAV 527 

+ E +L + F +++I+S ++Y+ AQ++I++ E+ +++ 

Sbjct: 573 VDNEANVL S KRFHKS VI KS KRAMTYEEAQN 1 1 DDATQQ NEIAKSL 617 

Query: 528 LNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLA 587 

NL+ +AK L+++R +GAL L ++ F +D ET P + + R++N +VEEFMLLA 
Sbjct: 618 RNLNRIAKILKKRRMDNGALVLASPEIRFQVDSETHEPLEVEVKQMRETNSMVEEFMLLA 677 

Query: 588 NMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTF- 646 

N+ VA I F E A+LRRHP P LV+ G . +D+ + S L+ SL K 

Sbjct: 678 NITVAEHIATEFSECAVLRRHPRPPPTNFDPLVKSARYQGFQVDINSGLELSHSLDKCVK 73 7 

Query: 647 GDDKYSIARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFAD 706 

D+ Y +LT +R M A+YF SG LQ +E+F HY L P+YTHFTSPIRR++D 
Sbjct: 738 ADNPYFNTMIRILT TRCMMQAVYF I SGS LQ - KEEFFH YGLAAP I YTHFTS P I RR YSD 793 

Query: 707 VIVHRLLAAALG YSEQPDVEPDTLQKQADHCNDRRMAS KRVQELSIGLFFAVLVKE 762 

++VHRLLAA++G Y++ D++++ +NR ++ S+L ++ 

Sbjct: 794 IMVHRLLAAS IGADSTYAQLLDRKSN- - EELCHNLNYRHKMAQYAGRASVALNTHLFFRG 851 

Query: 763 SGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDD 822 

+ E V+ V A VL+ ++G++ +Y + ++V K E+ + +D 

Sbjct: 852 KEE - DEEGYVLFVRKNALQVL I PKYGLEGTLYLKSDKDGKDGVERV- - KSE I VFTFNEED 908 

Query: 823 LEEEPTQQVITIFSLVDWLQAEATALKYSAILKR 857 

+ V F V V L + + + +++ ++ R 

Sbjct: 909 HTQRCGDWFHSFDPVTVRLSLDSSNVQHEKLIFR 943 

>gi | 21748526 | dbj |BAC03400.l| FLJ00327 protein [Homo sapiens] 
Length =2 66 

Score = 343 bits (879), Expect = 9e-93 

Identities = 167/188 (88%), Positives =* 175/188 (93%) 

Query: 578 KLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGA 63 7 

+LVEEFMLLANMAVAHRI R FPEQALLRRHPPPQT+MLSDLVEFCDQMGLP+D SSAGA 
Sbjct: 29 RL VE E FMLLANMAVAHKI HRAFPEQALLRRHPP PQTRML SDLVE FCDQMGL PVD FS S AGA 88 

Query: 63 8 LNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYAL3STVPLYTHF 697 

LNKS LT+TFGDDKYS LARKEVLTNM SRPMQMALYFCSG+LQD QFRHYALNVPLYTHF 
Sbjct : 8 9 LNKS L TQT FGDDKYS LARKEVLTNMC S R PMQMAL YF C S GLLQD P AQ FRHYALNVP L YTH F 14 8 

Query: 698 TSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQADHCNDRRMASKRVQELSIGLFFA 757 

TS PI RRFADV+ VHRLLAAALGY E+ D+ PDTLQKQADHCNDRRMASKRVQELS LFFA 
Sbjct: 149 TSPIRRFADVLVHRLLAAALGYRERLDMAPDTLQKQADHCNDRRMASKRVQELSTSLFFA 208 



t 



Query: 758 VLVKESGP 765 

VLVK S P 
Sbjct: 209 VLVKVSPP 216 

>gi | 26327903 J dbj | BAC27692 . 1 1 unnamed protein product [Mus musculus] 
Length =687 

Score = 340 bits (871) , Expect = 7e-92 

Identities = 233/670 (34%) , Positives = 344/670 (51%), Gaps = 60/670 (8%) 

Query: 192 SISTPDRGK^DSSTPVMKDENTPIPQDTRGLSEKSLQKSAKVVYILEKXHSRAATGILKL 251 

S+ D G+ + V KDE + T + SEK L+ + +W 1+ K++ R G+L 
Sbjct : 36 SWLDDEGQNEDD--VEKDEERELLLKT-AVSEKMLRPTGRWGII-KRNWRPYCGMLSK 91 

Query: 252 LADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCN 311 

K S + LF+P+D R+PRI + + + I I W + 

Sbjct: 92 SDIKES RRHLFTPADKRIPRIRIETR QASALEGRRI IVAIDGWPRNSR 139 

Query: 312 FALGQ LAKS LGQAGE IEPETEGI LTE YGVD F S D F S S E VL E CL PQ S L P WT I P PD E VGKRRD 371 

+ G K+LG GE E ETE +h E+ V FS VL LP+ + PW+I +++ R D 
Sbjct: 140 YPNGHFVKNLGDVGEKETETEVLLLEHDVPHQPFSQAVLSFLPR-MPWSITEEDMKNRED 198 

Query: 3 72 LRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATS 431 

LR C+ ++DP D+DDAL CR L++G EVGVHIADVS + F+ G++LD+ +A R T+ 
Sbjct: 199 LRHLCVCSVDPPGCTDIDDALHCRELSNGNLEVGVHIADVSHFIRPGNALDQESARRGTT 258 

Query: 432 VYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVrWKLTPEGKILEEWFGRTIIRSCTKLS 491 

VYL +K + M+P LL LCSL D+L FS IW++ +IL+ F +++I S L+ 

Sbjct: 259 VYLCEKRIDMVPELLSSNLCSLRSNVDRLAFSCIWEMNHNAEILKTRFTKSVINSKASLT 318 

Query: 492 YDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQ 551 

Y AQ I + + + + + ++ L+ +AK L+ + R GAL L 

Sbjct: 319 YAEAQMRIDSAAMN DDITTSLRGLNQLAKILKKGRIEKGALTLSS 363 

Query: 552 LKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPP 611 

+ +• F +D ET P E R++N +VEEFMLLAN++VA KI F E ALLR+HP P 

Sbjct: 364 PEIRFHMDSETHDPIDLQTKELRETNSMVEEFMLLANISVAKKIHEEFSEHALLRKHPAP 423 

Query: 612 QTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMAL 671 

LV+ L + +A +L SL + D L +L + +R M A+ 

Sbjct: 424 P P SNYD I L VKAAKS KNLQ I KTDTAKS LADS LDRAE S PDF P YL - - NTLLR I LATRCMMQAV 481 

Query: 672 YFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALG YSEQPDVEP 72 7 

YFCSGM D F HY L P+YTHFTSPIRR+AD+IVHRLLA A+G Y E D 
Sbjct: 482 YFCSGMDND FHHYGLAS PI YTHFTS PIRRYADI I VHRLLAVAIGADCTYPELTDK- - 536 

Query: 728 DTLQKQADHC NDRRMAS KRVQEL S IGL FFAVL VKESGPLES EAMVMGVLNQAFDVLV 784 

K+DC NR ++QS+ +KG+EA++V A VL+ 

Sbjct: 537 HKLSDICKNLNFRHKMAQYAQRASVAFHTQLFFKSKGIVSEEAYILFVRKNAIWLI 593 

Query: 785 LRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIFSLVDWLQA 844 

++G++ ++ +K KP L E LETV+FVV + 

Sbjct: 594 PKYGLEGTVFFE EKDKPKPRLAYDDEIPSLRIEGT- -VFHVFDKVKVKITL 642 

Query: 84 5 EATALKYSAI 8 54 

+++ L++ I 
Sbjct: 643 DSSNLQHQKI 652 



Score - 37.7 bits (86), Expect = 0.87 

Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 2/54 (3%) 

Query: 91 DRD I FIDGVVARNRALNGDLVVVKLLPEDQWKAVKPE SNDKE I EAT YEAD I PEE 144 

+++I I G+ NRA++ D+V V+LLP QW V P S + E E D+ ++ 



Sbjct: 1 EKEILIQGIKHLNRAVHEDIVAVELLPRSQW- -VAPSSVVLDDEGQNEDDVEKD 52 



, >gi | 13446610 j emb| CAC35051 . 1 j putative exoribonuclease DIS3 [Drosophila 
melanogaster] 

Length =983 

Score = 335 bits (859) , Expect = 2e-90 

Identities = 244/816 (29%), Positives = 401/816 (49%), Gaps = 111/816 (13%) 

Query: 50 FET YMSKEDVSEGLKRGTL I QGVLR I NPKKFHEAF I PS PDGDRD I FIDGWARNRALNGD 10 9 

+ ++S +++ EGL+ + L+QG + + + + E + ++ I I G + NRA++GD 

Sbjct: 232 YPPHLSMKELLEGLRQNKLLQGTFQASRENYLEGTVNVEKFEKGILIQGRESLNRAVDGD 291 

Query: 110 LWVKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVI IEAQF 169 

LV V+LLPE +W A P E + Y ++P 
Sbjct: 292 LVAVELLPEAEWSA- - PSE IVLEEKNVYADE VP 322 

Query: 170 DDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQK 229 

SE+R + + +++ V+ ++S E TP 
Sbjct: 323 SEERAKDENEMLNQVRAAALSA ERTP 348 

Query: 230 SAKWYILEKKHSRAATGILK- -LLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDF 287 

+ ++V 1+ +K R GIL+ L+ D N +F P+D + + PRI + + 

Sbjct: 349 TGR I VG I VRRKW - RQ YCG I LQ P S L I EDTNRH IFVPADRKIPRIRIETRQAAM- - 399 

Query: 288 MTRPKDFANTLF I CRI IDWKEDCNFALGQLAKSLGQAGE I EPETEGI LTEYGVDFSDFS S 347 

N I I W ' + + G +SLG G++ E E IL E-f V FS 
Sbjct: 400 LQNQRIIVTIDTWPRNSRYPHGHFVRSLGPLGDMATENEVILLEHDVPHCKFSD 453 

Query: 348 EVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDAIACRRLTDGTFEVGVH 407 

EVL LP+ +PWTI ++ KR DLR I ++DP D+DDAL CR L +G EVGVH 
Sbjct: 454 EVLSFLPK-MPWTITDEDYSKRVDLRDLYICSVDPPGCTDIDDALHCRELPNGNLEVGVH 512 

Query: 408 IADVSYFVPEGSSLDKVAAERA-TSWLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIW 466 

IADVS + F+ G + T+VYLV K + M+P LL LCSL ++ FS +W 

Sbjct: 513 I AD VSHF I RPGKRTGHGGSGPGETTVYL VGKRI DMVP ELLS SNLC S LVGGVERFAF S C VW 572 

Query: 4 67 KLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQA 526 

++ E +L + F +++I+S ++Y+ AQ++I++ E+ ++ 

Sbjct: 573 EVDNEANVLS KRFHKS VI KS KRAMT YEEAQN 1 1 DDATQQ NEIAKS 617 

Query: 527 VLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLL 586 

+ NL+ +AK L+ ++R +GAL X ++ F +D ET P + + R++N +VEEFMLL 
Sbjct: 618 LRNLNRLAKILKJCRRMDNGALVLASPEIRFQVDSETHEPLEVEVKQMRETNSMVEEFMLL 677 

Query: 587 ANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTF 646 

AN+ VA I F E A+LRRHP P LV+ G +D+ + S L+ SL K 

Sbjct: 678 ANITVAEHIATEFSECAVLRRHPRPPPTNFDPLVKSARYQGFQVDINSGLELSHSLDKCV 737 

Query: 647 -GDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFA 705 

D+ Y +LT +R M A+YF SG LQ +E+F HY L P+ YTHFTSPIRR++ 
Sbjct: 738 KADNPYFNTMIRILT TRCMMQAVYF I SGSLQ - KEEFFHYGLAAPI YTHFTS PIRRYS 793 

Query: 706 DVIVHRLLAAALG YS EQPDVE PDTLQKQADHCNDRRMAS KR VQELS I GLFFAVL VK 761 

D++VHRLLAA++G Y++ D + +++ +NR ++ S+L + + 

Sbjct: 794 DIMVHRLLAASIGADSTYAQLLDRKSN- - EELCHNLNYRHKMAQYAGRASVALNTHLFFR 851 

Query:, 762 ESGPLESEAMVTVIGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPD 821 

+ E V+ V A VL+ + +G+ + +Y + ++V K E+ + + 

Sbjct: 852 GKEE-DEEGYVLFVRKNALQVLIPKYGLEGTLYLKSDKDGKDGVERV- -KSEIVFTFNEE 908 

Query: 822 DLEEEPTQQVITIFSLVDWLQAEATALKYSAILKR 85 7 

D + V F V V- L + + + + + + ++ R 

Sbjct: 909 DHTQRCGD WFHS FD PVTVRLS LDS SNVQHEKL I FR 944 



>gi | 5262619 | emb | CAB45749 . 1 | hypothetical protein [Homo sapiens] 
Length =63 2 

Score = 334 bits (856) , Expect = 4e-90 

Identities = 226/644 (35%) , Positives = 335/644 (52%) , Gaps = 65/644 (10%) 

Query: 222 LSEKSLQKSAKVVYILEKXHSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLK 281 

+SEK L+ + +W 1+ K+ + R G+L . K S + LF+P+D R+PRI + + 
Sbjct: 8 VSEKMLKPTGRWGI I - KRNWRPYCGMLSKS DIKES RRHLFTPADKRIPRIRIETR 62 

Query: 282 DCPQDFMTRPKDFANTLFICRII DWKED CNFALGQLAKS LGQAGE I E PETEGI LTE 337 

A+TL RII W + + G + +LG GE E ETE + L E 

Sbjct: 63 Q ASTLEGRRIIVAIDGWPRNSRYPNGHFVRNLGDVGEKETETEVLLLE 110 

Query: 338 YGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRL 3 97 

+ V FS VL LP+ + PW+I + + R DLR CI ++DP D+DDAL CR L 
Sbjct: 111 HDVPHQPFSQAVLSFLPK-MPWSITEKDMKNREDLRHLCICSVDPPGCTDIDDALHCREL 169 

Query: 398 TDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMT 457 

+G EVGVHIADVS+F+ G++LD+ +A R T+VYL +K + M+P LL LCSL 
Sbjct: 170 ENGNLEVGVHIADVSHFIRPGNALDQESARRGTTVYLCEKRIDMVPELLSSNLCSLKCDV 229 

Query: 458 DKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXX 517 

D+L FS IW+ + +IL+ F + + + I S L+Y ^AQ I + + 
Sbjct: 230 DRLAFSCIWEMNHNAEILKTKFTKS VINSKASLTYAEAQLRIDSANMN 277 

Query: 518 US VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHI YE YRDSN 577 

+ + + ++ L+ +AK L+ ++R GAL L ++ F +D ET P E R++N 

Sbjct: 278 DDI TTS LRGLNKLAKI LKKRRI EKGALTLS S PEVRFHMDSETHDP IDLQTKELRETN 334 

Query: 578 KLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGA 63 7 

+VEEFMLLAN++VA KI F E ALLR+HP P LV+ L + +A + 

Sbjct: 335 SMVEEFMLLANI S VAKKIHEEFS EHALLRKHPAP PPSNYE I LVKAARS RNLE I KTDTAKS 394 

Query: 638 LNKS LTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHF 697 

L + SL + L +L + +R M A+YFCSGM D F HY L P+YTHF 
Sbjct: 395 LAES LDQAES PTFP YL - - NTLLRI LATRCMMQAVYFCSGMDND FHHYGLAS P I YTHF 449 

Query: 698 TS P I RRFADVI VHRLLAAALG YSEQPDVEPDTLQKQADHC NDRRMASKRVQEL 750 

TSPIRR+ADVIVHRLLA A+G YE D K AD C NR ++ Q 

Sbjct: 450 T S P I RRYAD V I VHRLLAVA I GADCTY P E LTDK HKLAD I CKNLNFRHKMAQYAQRA 504 

Query: 751 SIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGK 810 

S+ +KG+EA++V A VL+ ++G+ + + F + 

Sbjct: 505 SVAFHTQLFFKSKGIVSEEAYILFVRKNAIWLIPKYGLEGTV FFEEKD 553 

Query: 811 KPELTLVWEPDDLEEEPTQQVITIFSLVDWLQAEATALKYSAI 854 

KP L+ + + + + V +F V V + + + + L+ + I 

Sbjct: 554 KPNPQLI YDDE I PSLKI EDTVFHVFDKVKVKIMLDS SNLQHQKI 597 

>gi J 27703387 | ref | XP_224449 . 1 | similar to mitotic control protein dis3 
homo log [Homo sapiens] 

[Rattus norvegicus] 
Length = 1034 

Score = 330 bits (846) , Expect = 7e-89 

Identities - 233/679 (34%), Positives = 343/679 (50%), Gaps = 69/679 (10%) 

Query: 192 SISTPDRGKEDSSTPVTVIKDENTPIPQDTRGLSEKSLQKSAKVVYILEKKHSRAATGILKL 251 

S+ D G+ + V KDE + T + SEK L+ + +W 1+ K+ + R . G+L 
Sbjct: 374 SWLHDEGQNEDD--VEKDEERELLLKT-AVSEKMLRPTGRWGII-KRNWRPYCGMLSK 429 

Query: 252 LADKNSDLFKKYALFSPSDHRVPRIYVPLKDGPQDFMTRPKDFANTLFICRIIDWKEDCN 311 

K S + LF+P+D R+PRI + + + I I W + 

Sbjct: 430 SDIKES RRHLFTPADKRIPRIRIETR QASALEGRRI I VAIDGWPRNSR 477 



Query: 312 FALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRD 371 

+ G K+LG GE E ETE +L E+ V FS VL LP+ +PW+I +++ R D 
Sbjct: 478 YPNGHFVKNLGDVGEKETETEVLLLEHDVPHQPFSQAVLSFLPK-MPWSITEEDMKNRED 536 

Query: 3 72 LRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGTOIADVSyFVPEGSSLDCTAAERATS 431 

LR C+ ++DP D+DDAL CR L++G EVGVHIADVS + F+ G++LD+ +A R T+ 
Sbjct: 53 7 LRHLCVCSVDPPGCTDIDDALHCRELSNGNLEVGVHIADVSHFIRPGNALDQESARRGTT 596 

Query: 432 VYLVQKVVPMLPRLLCEELCSLNPMTDKL TFSVIWKLTPEGKILEEWFGRT 482 

VYL +K + M+P LL LCSL D+ F IW++ +IL+ F ++ 

Sbjct: 597 VYLCEKRIDMVPELLSSNLCSLRSNVDRYFCIVFDKXWHFPXIWEMNHNAEILKTRFTKS 656 

Query: 483 I IRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHS IAKQLRRQRF 542 

+ 1 S L+Y AQ I + + + + + ++ L+ +AK L+R R 

Sbjct:, 657 VI N S KAS L T YAE AQMR I D S AAMN DD I TTS LRGLNKLAKI LKRGRI 701 

Query: 543 VDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQ 602 

GAL L ++ F +D ET P E R++N +VEEFMLLAN++ VA KI F E 

Sbjct: 702 EKGALTLSSPEIRFHMDSETHDPIDLQTKELRETNSMVEEFMLLANISVAKKIHEEFSEH 761 

Query: 603 ALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNM 662 

ALLR+HP P LV+ L + +A +L SL + D L +L + 

Sbjct: 762 ALLRKHPAPPPSNYEVLVKAAKSKNLEIRTDTAKSLADSLDRAECPDFPYL- -NTLLRIL 819 

Query: 663 YS RPMQMAL YFCSGMLQDQEQFRHYALNVPL YTHFTS P I RRFADVI VHRLLAAALG 718 

+R M A+YFCSGM D F HY L P+ YTHFTS P IRR+AD+ IVHRLLA A+G 
Sbjct: 820 ATRCMMQAVYFCSGMDSD FHHYGLAS PI YTHFTS P IRRYAD 1 1 VHRL LAVA I GAD CT 876 

Query: 719 YSEQPDVEPDTLQKQADHC NDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGV 775 

YE D K+DC NR ++QS+ +KG+EA++V 

Sbjct: 877 YPELTDK HKLSDICKNLNFRHKMAQYAQRASVAFHTQLFFKSKGIVSEEAYILFV 931 

Query: 776 LNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIF 835 

A VL+- ++G++ ++ +K KP LT E L E T V +F 

Sbjct: 932 RKNAI WLI PKYGLEGTVFFE EKDKPKPRLTYDDEIPSLRIEGT- - VFHVF 980 

Query: 83 6 SLVDWLQAEATALKYSAI 854 

V V + +++ L++ I 
Sbjct: 981 DKVKVKI TLDS SNLQHQKI 999 



Score = 51.6 bits (122), Expect = 5e-05 

Identities = 30/99 (30%), Positives = 54/99 (54%), Gaps = 5/99 (5%) 

Que r y : 4 9 I FETYMSKEDVS EGLKRGTL I QGVLR INPKKFHEAF I PS PDGDRD I F IDG WARNRA 105 

IF ++ + +G+K G+ +QG R + + + EA + + +++I I G+ NRA 

Sbjct: 294 IFSEHLPLSKLQQGIKSGSYLQGTFRASRENYLEATVWIHGDKEDEKEILIQGLKHLNRA 353 

Query: 106 LNGDLVWKLLPEDQWKAVKPESNDKE I EATYEAD I PEE 144 

++ D+V V+LLP+ QW V P S E E D+ + + 

Sbjct: 354 I HED I VAVELL P KS QW - - VAP S S WLHDEGQNEDDVEKD 390 

>gi | 19923416 | ref | NP_055768 . 2 | mitotic control protein dis3 homolog [Homo 
sapiens] 

gi | 17225572 | gb | AAL37479 . 1 | AF330044_1 KIAA1008 protein [Homo sapiens] 
Length = 958 

Score = 330 bits (845) , Expect = 7e-89 

Identities - 229/674 (33%), Positives = 344/674 (51%), Gaps = 68/674 (10%) 



Query: 192 SISTPDRGKEDSSTPV>IKDENTPIPQDTRGLSEKSLQKSAKVVYILEK^SRAATGILKL 251 

S+ D G+ + ++ + +SEK L+ + +W -1+ K++ R G-fL 

Sbjct: 307 SWLHDEGQNEEDVEKEEETERMLKT AVSEKMLKPTGRWGII -KRNWRPYCGMLSK 362 



Query: 252 LADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRII DWK 307 

K S + LF+P+D R+PRI + + A+TL RII W 

Sbjct: 363 SDIKES RRHLFTPADKRIPRIRIETRQ- - - ASTLEGRRI IVAIDGWP 406 

Query: 3 08 EDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVG 367 

+ + G ++LG GE E ETE +L E+ V FS VL LP+ +PW+I ++ 
Sbjct: 407 RNSRYPNGHFVRNLGDVGEKETETEVLLLEHDVPHQPFSQAVLSFLPK-MPWSITEKDMK 465 

Query: 368 KRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAE 427 

R DLR CI ++DP D+DDAL CR L +G EVGVHIADVS+F+ G++LD+ +A 
Sbjct: 466 NREDLRHLCICSVDPPGCTDIDDALHCRELENGNLEVGVHIADVSHFIRPGNALDQESAR 525 

Query: 428 RATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSC 487 

R T+VYL +K + M+P LL LCSL D+L FS IW++ +IL+ F +++I ■ S 

Sbjct: 526 RGTTVYLCEKRIDMVPELLSSNLCSLKCDTORLAFSCIWEMNHNAEILKTKFTKSVINSK 585 

Query : 4 88 TKLS YDHAQSMI ENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHS IAKQLRRQRFVDGAL 54 7 

L+Y AQ I + + + + + ++ L+ +AK L+ ++R GAL 

Sbjct: 586 ASLTYAEAQLRIDSANMN DDITTSLRGLNKLAKILKKRRIEKGAL 630 

Query: 548 RLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRR 607 

L ++ F +D ET P E R++N +VEEFMLLAN++VA KI F E ALLR+ 

Sbjct: 631 TLSSPEVRFHMDSETHDPIDLQTKELRETNSMVF1EFMLLANISVAKKIHEEFSEHALLRK 690 

Query: 608 HPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPM 667 

HP P LV+ L + +A +L +SL + L +L + +R M 

Sbjct: 691 H PAP P PSNYE I L VKAARS RNLE I KTDTAKS LA.E SLDQAES PTF P YL - - NTLLR I LATRCM 748 

Query: 668 QlVIALYFCSGMLQDQEQFRHYALKrVPLYTHFTSPIRRFADVIVHRLLAAALG YSEQP 723 

A+YFCSGM D F HY L P+YTHFTSPIRR+ADVIVHRLLA A+G Y E 
Sbjct: 749 MQAVYFCSGMDND FHH YGLAS P I YTH FT S P I RR YAD VI VHRLLAVA I GAD CT Y PELT 805 

Query: 724 D VE PDTLQKQADHC NDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGVLNQAF 780 

D K AD C NR ++QS+ +KG+EA++V A 

Sbjct: 806 DK HKLAD I CKNLNFRHKMAQ YAQRAS VAFHTQLFFKS KGI VS EEA Y I LFVRKNAI 860 

Query: 781 DVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIFSLVDV 84 0 

VL+ ++G++ + F + KP L+++ + + V +F V V 

Sbjct: 861 WL I PKYGLEGTV FFEEKDKPNPQLIYDDEIPSLKIEDTVFHVFDKVKV 909 

Query: 841 VLQAEATALKYSAI 8 54 

+ +++ L++ I 
Sbjct: 910 KIMLDSSNLQHQKI 923 



Score = 50.1 bits (118), Expect = 2e-04 

Identities = 27/83 (32%) , Positives = 48/83 (57%) , Gaps = 5/83 (6%) 

Query : 49 I FETYMS KEDVSEGLKRGTL I QGVLR INPKKFHEAF I PSPDGDRDIFIDGWARNRA 105 

IF ++ + +G+K GT +QG R + + + EA + + +++I + G+ NRA 

Sbjct: 227 IFSEHLPLSKLQQGIKSGTYLQGTFRASRENYLEATVWIHGDSEENKEIILQGLKHLNRA 286 

Query: 106 LNGDLVWKLLPEDQWKAVKPES 12 8 

++ D+V V+LLP+ QW VPS 
Sbjct: 28 7 VHEDIVAVELLPKSQW- - VAPSS 3 07 

>gi | 6324552 | ref |]NTP_014621 . 1 1 Possible component of RCCl-Ran pathway; Dis3p 
[Saccharomyces 

cerevisiae] 

gi | 2500547 | sp | Q08162 | RR44_YEAST Exosome complex exonuclease RRP44 
(Ribosomal RNA processing protein 
44) {Protein DIS3) 

gi | 2131990 | pir | | S66704 hypothetical protein YOL021C - yeast (Saccharomyces 
cerevisiae) 

gi 1 1419801 1 emb | CAA99021 . 1 1 ORF YOL021c [Saccharomyces cerevisiae] 



gi| 1754617 |dbj | BAA11176 . 1 | DIS3 protein [Saccharomyces cerevisiae] 
Length = 10 01 

Score = 321 bits (823), Expect = 3e-86 

Identities = 253/761 (33%), Positives - 364/761 (47%), Gaps = 99/761 (13%) 

Query: 50 FETYMSKEDVSEGLKKGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGVVARlSrRALNGD 109 

FYS V GLK G L QG + + I + FE + P + + IG NRA NGD 
Sbjct: 255 FPEYYSTARVMGGLKNGVLYQGNIQISEYNFLEGSVSLPRFSKPVLIVGQKNLNRAFNGD 314 

Query: 110 LWVKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQF 169 

V+V+LLP+ +WKA D E H + + PD IEA 

Sbjct: 315 QVIVELLPQSEWKAPSSIVLDSE HFDVNDN PD--IEAG- 350 

Query: 170 DDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQK 229 

DD D+ + NT+ + D ++L + KD I Q. + K +Q 

Sbjct: 351 DDDDNNES S SNTTVI SDKQRRL LAKD- -AMIAQRS KKIQP 388 

Query: 23 0 SAKVVYILEKKHSRAATGILKLLADKN 2 88 

+AKWYI + + + R G LA + D P ++V L D C 

Sbjct: 389 TAKWYI-QRRSWRQYVG QLAPSSVD PQS S STQNVFVI LMDKCLPKVR 435 

Query: 289 TRPKDFANTL F I CR 1 1 DWKEDCNFALGQLAKSLGQAGE I E PETEGI LTE YGVDFSDF 345 

R+AL + IW +LG +LG + ETE +L E+ V++ F 

Sbjct: 436 IRTRRAAELLDKRIVISIDSWPTTHKYPLGHFVRDLGTIESAQAETEALLLEHDVEYRPF 495 

Query: 346 SSEVLECLP-QSLPWTIP PDEVG KRRDLRKDCIFTIDPSTARDLDDALA 3 93 

S +VLECLP + W P P+ V KR+DLR .1 +IDP D+DDAL 

Sbjct: 496 SKKVLECLPAEGHDWKAPTKLDDPEAVSKDPLLTKRKDLRDKLICSIDPPGCVDIDDALH 555 

Query: 3 94 CRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSL 4 53 

+ + L +G + EVGVHI ADV+ + FV G++LD A R TSVYLV K + MLP LL +LCSL 
Sbjct: 556 AKKLPNGNWEVGVHIADVTHFVKPGTALDAEGAARGTSVYLVDKRIDMLPMLLGTDLCSL 615 

Query: 454 NPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXX 513 

P D+ FSVIW+L 1+ F +++IRS SY+ AQ I++ 
Sbjct: 616 KPYVDRFAFSVIWELDDSANIVNVNFMKSVIRSREAFSYEQAQLRIDDKTQN 667 

Query: 514 XXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEY 573 

+E+ + L ++ +L+ ++R GAL L ++ +D ET P I + 
Sbjct: 668 D E L TMGMRAL L KL S VKLKQ KRL EAGALNLAS P E VKVHMD S E T S D PN E VE I KKL 720 

Query: 574 RDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCD-QMGLPMDV 632 

+N LVEEFMLLAN++VA KI+ FP+ A+LRRH P+ LE+++ + + 

Sbjct: 721 LATNSLVEEFMLLANISVARKIYDAFPQTAMLRRHAAPPSTNFEILNEMLNTRKNMSISL 780 

Query: 633 SSAGALNKSLTKTFG-DDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNV 691 

S+ AL SL + +D Y ++ M +R M A YF SG FRHY L V 

Sbjct: 781 ESSKALADSLDRCVDPEDPYF NTLVRIMSTRCMMAAQYFYSGAYS - YPDFRHYGLAV 836 

Query: 692 PLYTHFTS P I RRFADVI VHRLLAAALGYS EQPD VEPD - - TLQKQADHCNDRRMASKRVQE 749 

+YTHFTSPIRR+ DV+ HR LA A+GY D + + N + + + 

Sbjct: 837 D I YTHFTS P IRRYCD WAHRQLAGAIGYE PLS LTHRDKNKMDMI CRN INRKHRNAQFAGR 896 

Query: 750 LSIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQ 79 0 

SI + + + + + E+ V+ V N VLV +FGV+ 
Sbjct: 897 AS I E Y YVGQ VMRNNE S T E T - G YV I KVFNNG I WL VP KFG VE 936 

>gi|2129733l|gb|EAA09476.l| ebiP3704 [Anopheles gambiae str. PEST] 
Length =9 65 

Score = 313 bits (803), Expect = 6e-84 

Identities = 228/691 (32%), Positives = 352/691 (50%), Gaps = 62/691 (8%) 



Query: 180 NT S GL VDG VKKL SIS T PDRGKED S S T P VMKDENT P I PQDTRGLSEKSLQK- SAKV 233 



N + VDG P+ + S V+-DE T ■ P T + ++ ++ +AKV 

Sbjct: 280 NLNRAVDGDTVAIEMLPESEWKAPSDWLVDEQTDPGDMVEPDPTFSVKPQAEREPTAKV 339 

Query: 234 VYILEKKHSRAATGILKIjLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKD 293 

V I+++K R GIL + S +F P++ + + PRI + + 
Sbjct: 340 VGIIKRKW-RQYCGILLPSHIQGS TRHI FVPAERKI PR IR I ETRQ 383 

Query: 294 FANTLFICRI I DWKEDCNFALGQLAKS LGQAGE I E PE TEG I L TE YGVD FSD FS S E V 3 49 

A TL RII W + G ++LG G E E E IL E+ V S FS +V 

Sbjct: 384 -AATLLSQRIIVAIDQWPRHSRYPQGHFVRALGPIGSKETENEVILLEHDVPHSRFSEDV 442 

Query: 350 LECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIA 409 

L CLP+ LPWTI ++ +R DLR + ++DP D+DDAL RRL +G EVGVHIA 
Sbjct: 443 LACLPE-LPWTITAADLQRRVDLRDITVCSVDPPGCTDIDDALHARRLPNGNIEVGVHIA 501 

Query: 410 DVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLT 469 

DV +F+ G++LD+ AA RAT+VYLV K + M+P LL LCSL ++ FS IW+L 
Sbjct: 502 DVGHFIRPGTALDREAASRATTVYLVDKRID3VIVPGLLSSNLCSLRGGEERFAFSCIWELD 561 

Query: 470 PEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLN 529 

EI + +++I+S + L+Y+ AQ +I++ +V ++ 

Sbjct: 562 DEANIRNTRYHKSVIKSKSALTYEEAQIIIDD-- - - AKQTNDVATSLRL 606 

Query: 530 LHSIAKQLRRQRFVDGALRLDQLKXAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANM 589 

L+ +AK L+ ++R GAL L ++ F +D ET P + ++N +VEEFMLLAN+ 

Sbjct: 607 LNRIAKILKKRRTEKGALVLASPEIRFQVDSETHDPIDVKAKQLLETNSMVEEFMLLANV 666 

Query: 590 AVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDD 649 

+VA KI + FPE A+LRRHP P LV+ + G + +S L SL K D 

Sbjct: 667 S VAE KI EQEF PE CAMLRRHPC P PQAN YE PLVKAAEHQGFE I LTTS GKELAT S LDKAVKPD 726 

Query: 650 KYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTS P I RRFADVI V 709 

+L + +R M "A+YF SG +Q +++F HY L P+YTHFTSPIRR+AD+IV 
Sbjct: 727 NPYF - - NTMLRI LATRCMMQAVYFI SGTVQ - RDEFFHYGLAAP I YTHFTS P IRRYAD I IV 783 

Query: 710 HRLLAAALGYSEQPDVEPDTLQKQADH- -CND RRMASKRVQELSIGLFFAVLVKESG 764 

HRLLAA +G P+ L K+ + CN+ R ' ++ S+ L + ++ 

Sbjct: 784 HRL LAAC I GAD S TYPELLDKKVNSNLCNNLNYRNRMAQYAGRASVALHTHLFFRKRS 840 

Query: 765 PLE S EAMVMGVLNQAFDVLVLRFGVQKRI YCNALALRS YS FQKVGKKPELTLVWE PDDLE 824 

+ +A ++ + A +LV ++G + IY R+ K G + V++ + 

Sbjct: 841 E-DEQAYILFIRKNALQILVPKYGFEGTIYVTG RNNEEIKTGVR- . FVYDEEQQT 892 

Query: 825 EEPTQQVITIFSLVDWLQAEATALKYSAIL 855 

+ Q V F V V L ++T + + + + + 
Sbjct: 893 QRCGQ VVFRAFDP VI VRLSLDSTNVQHEKLV 923 



Score = 53.1 bits (126), Expect = 2e-05 
Identities = 23/75 (30%) , Positives = 46/75 (61%) 

Query : 49 IFETYMSKEDVSEGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNG 108 

+F + + + + + EG+K G L+QG + + + E-/-+ ++ + IG + NRA++G 

Sbjct: 228 LFPSHLTMLQ I HEGI KGGKLMQGGFLASRENYLEGYVRVEG I EKAVL I QGRMNLNRAVDG 287 

Query: 10 9 DLVWKLLPEDQWKA 12 3 

D- V +++LPE +WKA 
Sbjct: 288 DTVAIEMLPESEWKA 302 

>gi 1 14250908 | emb | CAC3 9259 . 1 1 Rrp44p homologue t Trypanosoma brucei] 
Length = 972 

Score = 313 bits (801) , Expect = le-83 

Identities = 187/536 (34%), Positives = 289/536 (53%), Gaps = 35/536 (6%) 



Query: 265 LFSPSDHRVPRXYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQA 324 

LF P D+R+PRI + ' D + I DW E +F +G + LG 

Sbjct: 396 LFQPKDNRI PRIRIT TAHLGDLKDKRLSVIIDDWGEHSSFPVGHYVEVLGTI 447 

Query: 325 GEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPST 384 

G+ + E + IL E + DFS V +CLP+ W + +E+G R DLR C+ ++DP 
Sbjct: 448 GDK^TEAKVILLENDIPHYDFSEAVYDCLPKG-EWNVTEEELGMRLDLRDLCWSVDPLG 506 

Query: 3 85 ARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPR 444 

RD+DDAL CRR+ EVGVHIADV+ + F+ EG+++D+ AA+R+TSVYLV + + MLP+ 

Sbjct: 507 CRDIDDALHCRRVNGNHLEVGVHIADVTHFLKEGTAMDEEAAKRSTSVYLVDRRINMLPQ 566 

Query: 445 LLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXX 504 

LL E LCS + D+ FS++W+ ++ E+FG+T+IRS h Y AQ MI++ 

Sbjct: 567 LLTENLCSIVADEDRYAFSIMWEFDENYSWREFFGKTVIRSRAALYYGDAQRMIDDPED 626 

Query: 505 XXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGL 564 

E ++ L +++ R++R DGAL L + F +D++ 
Sbjct: 627 E SEAAVSLRYLMQLSRHFRKRREKDGALFLCSQEFKFKVDNDHVN 671 

Query: 565 PQGCHI YE YRDSMK^VEEFMLLAJSTMAVAHKI FRTFPEQALLRRHPP PQTKMLSDL VEFC - 623 

P Y+ DSN ++EE+ML AN A A +++ +FP LLRRH P L E 

Sbjct: 672 PTDMQAYQTFDSNSMIEEWMLFANAAAARRVYASFPRWTLLRRHQAPAENAFDTLNEAIR 731 

Query: .624 DQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQ 683 

+ +G+ +D + + + ALN+SL K . R ++ + +R ++ A YF S + + + + 

Sbjct: 732 RKIGVKLDDTTSLALNESLEKCVDPSDPYFNR- -LIRTLVTRCLRQAQYFSSSEVS-KDE 788 

Query: 684 FRHYALNVPL YTHFTS P I RRFAD VI VHRLLAAALG YS EQPD VE PDTLQKQ - - ADHCNDRR 741 

F H+ L +P-t-YTHFTSPIRR+ADVTVHR LAAALG + + + + A + N R 

Sbjct: 789 FHHFGLAMP I YTHFTS P I RRYADVI VHRQ LAAALG I MDVS EAHMVS VKMEALASNLNYRH 848 

Query: 742 MASKRV QEL S I GLFFAVLVKE S GPLES EAMVMGVLNQAFDVLVLRFGVQKRI 793 

++R QLG+ +PE+V F VLV ++G + +1 

Sbjct: 849 EQAQRAGRDSQNLFTGFYLRNFANQEIPSEDGYWKLSETHVF- VLVPKYGQEGKI 903 

Score = 40.0 bits (92), Expect = 0.17 

Identities = 26/115 (22%) , Positives = 52/115 (45%) , Gaps = 9/115 (7%) 

Query: 49 I FETYMS KEDVS EGLKRGTL I QGVLR INPKKFHEAF I PS PDGD RDIFIDGWAR 102 

+F ++++ + G++ GT + +G LR++ F G + + G 

Sbjct: 235 LFSPHLAESALDLGVQNGTYLRGKXRVSETN CFFGEIRGQWKGHNFERVLLPGRTNL 291 

Query: 103 NRALNGDLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKG 157 

NRA++GD+V V+LLP W+ ++ +E+ T EG + + +G 

Sbjct: 292 NRAIHGDIVTVELLPVASWRPLRGAKPTEEMNDTGAGGDDHENSGREGIGEESEG 346 

>gi | 19113445 | ref |NP_596653 . 1 | mitotic control protein dis3 
[Schizosaccharomyces pombe] 

sp| P3 72 02 |DIS3_SCHPO Mitotic control protein dis3 
pir| |A41944 mitotic control protein dis3 + - fission yeast 
(Schizosaccharomyces 
pombe) 

gi | 173381 (gb| AAA35302 . 1 | mitotic control protein 
gi I 3650393 | ernb | CAA21102 . 1 1 mitotic control protein dis3 
[Schizosaccharomyces pombe] 
Length = 97 0 

Score = 301 bits (770) , Expect = 4e-80 

Identities =202/594 (34%), Positives = 301/594 (50%), Gaps = 43/594 (7%) 

Query: 218 DTRGLSEKSLQKSAKWYILEKKHSRAATGILKLLADKNSDLFKKYALFSPSDHRVPRIY 277 

D + + + + + +AKW IL++ + ++ + + L -f-P D RVP+I 

Sbjct: 348 DLEL I TKRNAHPTAKWGI LKRNWRP YVGHVDNAT I AQS KGGS QQT VLLTPMDRRVPKI R 407 



gi | 585053 
gij 283075 



Query: 278 VPLKDCPQDFMTRPKDFANTLF I CRI I DWKEDCNFALGQLAKS LGQAGE I E PETEGI LTE 337 

+ P+R + I W +G+ LG+ E ETE +L E 

Sbjct: 408 FRTRQAPRLVGRR 1 WAI DLWDAS SRYPEGHFVRDLGEMETKEAETEALLLE 459 

Query: 338 YGVDFSDFSSEVLECLPQS-LPWTIPPDEV GKRRDLRKDCIFTIDPSTARDLDDAL 392 

Y V F VL+CLP+ W +P D+ R+D R I +IDP +D+DDAL 

Sbjct: 460 YDVQHRPFPKAVLDCLPEEGHNWKVPADKTHPLWKNRKDFRDKLICSIDPPGCQDIDDAL 519 

Query: 393 ACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCS 452 

L +G +EVGVHIADV+ + FV +S+D AA R T+VYLV K + MLP LL +LCS 
Sbjct: 520 HACVLPNGNYEVGVHIADVTHFVKPNTSMDSEAASRGTTVYLVDKRIDMLPMLLGTDLCS 579 

Query: 453 LNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXX 512 

LP ++ FS IW++ I++ F +++I S SY AQ+ I++ 
Sbjct: 580 LRPYVERFAFSCIWEMDENANIIKVHFTKSVIASKEAFSYADAQARIDDQKMQ 632 

Query: 513 XXXXXHSVEEWQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYE 572 

+ + Q + L ++K L+ ++R +GAL L ++ D+ET P I + 
Sbjct: 633 D PLTQGMRVLLKLS KI LKQKRMDEGALNLAS PEVR I QTDNETS DPMDVE I KQ 684 

Query; 573 YRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRH-PPPQTKM- -LSDLVEFCDQMGLP 629 

+ +N LVEEFMLLAN++VA KI+ FP+ A+LRRH PP T L D++ C ML 
Sbjct: 685 LLETNSLVEEFMLLANISVAQKIYDAFPQTAVLRRHAAPPLTNFDSLQDILRVCKGMHLK 744 

Query: 630 MDVSSAGALNKSLTKTFGD- -DKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHY 687 

D S KSL K+ + D +L + +R M A YFCSG + FRHY 

Sbjct: 745 CDTS KSLAKSLDECVDPKEPYFNTLLRILTTRCMLSAEYFCSGTFAPPD-FRHY 797 

Query: 688 ALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSE-QPDV-EPDTLQKQADHCNDRRMASK 745 

L P+YTHFTSPIRR+ADV+ HR LAAA+ Y P++ L++NR ++ 
Sbjct: 798 GLAS P I YTHFT S P I RRYAD VLAHRQ LAAAI D YE TINPSLSDKSRLIEI CNG I N YRHRMAQ 857 

Query: 746 RVQELSIGLFFAVLVKESGPLESEAMVMGVLNQAFDVLVLRFGVQKRIYCNALA 799 

SI + +K G E +A V+ V F V + RFG++ +Y +L+ 
Sbjct: 858 MAGRASIEYYVGQALK-GGVAEEDAYVIKVFKNGFWFIARFGLEGIVYTKSLS 910 



Score = 49.7 bits (117), Expect = 2e-04 
Identities = 23/84 (27%) , Positives = 46/84 (54%) 

Query: 4 9 I F ET YMS KED VS EGL KRGTL I QGVLR INP KKFHEAF IPS PDGDRD I F I DG WARNRALNG 108 

+ +E + S + +K G + +G++ 1+ + E + P ++++G NRA+ G 
Sbjct: 247 VYELHWSMS RLLAC I KNGEVHKGL IN I STYNYLEGS VWPGYNKP VLVS GRENLNRAVQG 306 

Query: 109 DLWVKLLPEDQWKAVKPESNDKE 132 

D+V +++LP+DQWK E D + 
Sbjct: 3 07 DIVCIQILPQDQWKTEAEEIADDD 330 

>gi | 15021874 | dbj |BAB62212 . 1 | hypothetical protein [Macaca f ascicularis] 
Length =24 9 

Score = 291 bits (745) , Expect = 3e-77 

Identities = 156/247 (63%), Positives = 171/247 (69%), Gaps = 2/247 (0%) 

Query; 1 MNHPDYKLNLRXXXXXXXXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXIFETYMSKEDVS 60 

M+HPDY+ +NLR D IFETYM KEDVS 

Sbjct : 1 MSHPDYRMNLRPLGTPRGVSTVAGPHGIGASPGDKKSKNKSTRGKKKSIFETYMFKEDVS 60 

Query: 61 EGLKRGTLIQGVLRINPKKFHEAFI PS PDGDRD IFIDGVVARNRALNGDLVVVKLLPEDQ 120 

EGLKRGTL I QGVLRI NPKKFHEAF IPS PDGDRD I F I DGWARNRALNGD LVWKLL PE + Q 
Sbjct; 61 EGLKRGTLIQGVLRINPKKFHEAFIPSPDGDRDIFIDGVVARNRALNGDLVVVKLLPEEQ 120 

Query; 121 WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWS -GPDVIIEAQFDDSDSEDRHG 179 

WK VKPE+NDKE EA YE+DIPEE CG H QQS K ++ PDVIIEAQFD SDSED HG 



Ill 



Sbjct: 121 WKWKPENNDKETEAAYESDIPEELCGRHLPQQSLKSYNDSPDVIIEAQFDGSDSEDGHG 180 

Query: 180 NTSG-LVDGVKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKWYILE 238 

T LVD VKKLS+ ++G+E PV KDE T I QDTR L EKSLQ+SAKV+ 

Sbjct: 181 ITQNVLVDDVKKLSVCVSEKGREHGDAPVTKDETTCISQDTRALPEKSLQRSAKVIAYRF 240 

Query: 23 9 KKHSRAA 24 5 

H + A 

Sbjct: 241 SSHVQMA 247 

>gi | 19115966 |ref |NP_588616.1 [ hypothetical protein MGC4562 [Homo sapiens] 

gi j 18314381 | gb | AAH22089 . 1 | AAH22089 Similar to mitotic control protein dis3 
homolog [Homo sapiens] 
Length = 971 

Score = 284 bits {121), Expect = 4e-75 

Identities = 190/586 (32%), Positives - 297/586 (50%), Gaps = 50/586 (8%) 

Query: 265 LFSPSDHRVPRIYVPLKDCP- -QDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLG 322 

L +P D+R+P+I + + QDF + RI W+ + G + LG 

Sbjct: 283 LVTPWDYRIPKIRISTQQAETLQDFRV , WRIDS WESTS VYPNGHFVRVLG 332 

Query: 323 QAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSL PWT I PPDEVGKRRDLRKD - C I F 378 

+ G++E E IL E + FS + +P + PW + P+E KR+DLRK +F 

Sbjct: 333 RI GDLEGEI ATI LVENS I S VI PFS EAQMCEMPVNTPES PWKVS PEEEQKRKDLRKSHLVF 392 

Query: 379 TIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQfCV 438 

+ IDP D+DD L+ R L +G E+GVHIADV+ + FV S +D A RAT+ YL + 
Sbjct: 3 93 SIDPKGCEDVDDTLSVRTLNNGISTLELGVHIADVTHFVAPNSYIDIEARTRATTYYLADRR 4 52 

Query: 439 VPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEG-KILEEWFGRTIIRSCTKLSYDHAQS 497 

MLP +L +LCSL D+ S++W+L +1 + W+GRTIIRS KL Y+ AQ 

Sbjct: 453 YDMLPS VLS ADLC S LLGGVDR YAVS I MWELDKAS YE I KKVW YGRT I 1 RSA YKLF YEAAQE 512 

Query: 498 MIENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLK 553 

+++ S +EE+ A+ L IA+ +R +R GAL L+ + + 

Sbjct: 513 LLDGNLSWDDIPEFKDLDEKSRQAKLEELVWAIGKLTDIARHVRAKRDGCGALELEGVE 572 

Query: 554 LAFTLD HETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRH 608 

+ LD H+ Q + + E V E M+LAN VA KI+ + FP QALLR+H 

Sbjct: 573 VCVQLDDKKNIHDLI PKQPLEVHE TVAECMILANHWVAKKIWESFPHQALLRQH 626 

Query: 609 PPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQ 668 

PPP + S+L E G +D S L SL +D + +L +M ++ M 

Sbjct: 627 PPPHQEFFSELRECAKAKGFFIDTRSNKTLADSLDN- -ANDPHDPIVNRLLRSMATQAMS 684 

Query: 669 MALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPD 728 

ALYF +G +E+F HY L + YTHFTSPIRR-f +D++VHRLL AA+ ++ +++ + 

Sbjct: 685 NAL YFS TGS CA - EEE FHHYGLALDKYTHFTS P I RRYSDI WHRLLMAAI S KDKKME I KGN 743 

Query: 729 TLQKQADHCNDRRMAS KRVQELS I GLFFAVLVKES GPLE SEAMVMGVLNQ 778 

L++ H N+R A++ Q+ S LF + K+ P S+ ++ + 

Sbjct: 744 LFSNKDLEELCRHINNRNQAAQHSQKQSTELFQCMYFKDKDPATEERCISDGVIYSIRTN 803 

Query: 779 AFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLE 824 

+ + RFG++ Y L++ + P+ W+P L+ 

Sbjct: 804 GVLLFIPRFGIKGAAY LKNKDGLVISCGPDSCSEWKPGSLQ 844 



Score = 59.3 bits (142), Expect = 3e-07 

Identities = 33/82 (40%), Positives = 51/82 (62%), Gaps = 8/82 (9%) 

Query : 50 FET YMS KED VS EGL KRGTL I QG VLR-INPKKFH - EAF I PSPDGD- -RDIFIDGWA 101 

+ ++ E + G+K G IQG+L +M + EAF+ S D D DI I G+ A 

Sbjct: 142 YPEHLPLE VLEAG I KSGR Y I QG I LNVNKHRAQ I EAFVRLQGAS S KDSDL VSD I L I HGMKA 201 



Query: 102 RNRALNGDLVWKLLPEDQWKA 12 3 

RNR+++GD+VW+LLP+ ++WK 
Sbjct: 202 RNRSIHGDWWELLPKNEWKG 223 



>gi | 18 916779 | dbj | BAB85541 . 1 | KIAA1955 protein [Homo sapiens] 
Length = ' 94 7 

Score = 284 bits (726) , Expect = 5e-75 

Identities = 190/586 (32%), Positives = 297/586 (50%), Gaps = 50/586 (8%) 

Query: 265 LFSPSDHRVPRI YVPLKDCP- - QDFMTRPKDFANTLFICRI IDWKEDCNFALGQLAKSLG 322 

L +P D+R+P+I + + QDF + RI W+ + G + LG 

Sbjct: 259 LVTPWDYRI PKI RI STQQAETLQDFRV WRIDSWESTSVYPNGHFVRVLG 308 

Query: 323 QAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSL PWT I P PDEVGKRRDLRKD - CI F 3 78 

+ G++E E IL E 4 FS + +P + PW + P+E KR+DLRK +F 

Sbjct: 309 RIGDLEGEIATILVENSISVIPFSEAQMCEMPVNTPESPWKVSPEEEQKRKDLRKSHLVF 368 

.Query: 379 TIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKV 438 

+IDP D+DD L+ R L +G E+GVHIADV++FV S +D A RAT+ YL + 
Sbjct: 369 SID PKGCED VDDTLS VRTLNNGNLELGVH I ADVTHFVAPNS YI D I EARTRATTY YLADRR 428 

Query: 439 VPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEG-KILEEWFGRTIIRSCTKLSYDHAQS 497 

MLP +L + LCSL D+ S++W+L +1 + W+GRTIIRS KL Y+ AQ 

Sbjct: 429 YDMLPSVLSADLCSLLGGVDRYAVSIMWELDKASYEIKKVWYGRTIIRSAYKLFYEAAQE 488 

Query: 4 98 MIENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLK 553 

+ + + S +EE+ A+ L IA+ +R +R GAL L+ ++ 

Sbjct: 489 LLDGNLSWDDIPEFKDLDEKSRQAKLEELVWAIGKLTDIARHVRAKRDGCGALELEGVE 548 

Query: 554 LAFTLD HETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRH 608 

+ LD H+ Q ++E V E M+ LAN VA KI+ +FP QALLR+H 

Sbjct: 549 VCVQLDDKKNIHDLIPKQPLEVHE TVAECMILANHWVAKKIWESFPHQALLRQH 602 

Query: 6 09 PPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQ 668 

PPP + S + L E G +D S - L SL +D + +L +M ++ M 

Sbjct: 603 PPPHQEFFSELRECAKAKGFFIDTRSNKTLADSLDN- -ANDPHDPIVNRLLRSMATQAMS 660 

Query; 669 MALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPD 728 

ALYF +G +E+F HY L + YTHFTSPIRR++D++VHRLL AA+ ++ +++ + 
Sbjct: 661 NALYFSTGSCA-EEEFHHYGIJ\LDKYTHFTSPIRRYSDIVVHRLLMAAISKDKKMEIKGN 719 

Query: 729 TLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLE SEAMVMGVLNQ 778 

L++ H N+R A++ Q+ S LF + K+ P S+ ++ + 

Sbjct: 720 LFSNKDLEELCRHINNRNQAAQHSQKQSTELFQCMYFKDKDPATEERCISDGVIYSIRTN 779 

Query: 779 AFDVLVLRFGVQKRI YCNALALRS YS FQKVGKKPELTLWE PDDLE 824 

+ + RFG+ + Y L++ + P+ W+P L+ 

Sbjct: 780 GVLLFIPRFGIKGAAY LKNKDGLVISCGPDSCSEWKPGSLQ 820 



Score = 59.3 bits (142), Expect = 3e-07 

Identities = 33/82 (40%), Positives = 51/82 (62%), Gaps = 8/82 (9%) 

Query: 50 FETYMSKEDVSEGLKRGTLIQGVLRINPKKFH-EAFI PSPDGD- -RDIFIDGWA 101 

+ ++ E + G+K G IQG+L +N + EAF+ S D D DI I G+ A 

Sbjct: 118 YPEHLPLEVLEAGIKSGRYIQGILNVNKHRAQIEAFVRLQGASSKDSDLVSDILIHGMKA 177 

Query: 102 RNRALNGDLVWKLLPEDQWKA 123 

RNR+ + +GD+VW+LLP+++WK 
Sbjct: 178 RNRSIHGDVVWELLPKNEWKG 199 

>gi|27369724|ref |NP_766107.1| hypothetical protein 4932411M14 [Mus ■ 
musculus] 



gi | 26325912 | dbj | BAC26710 . l| unnamed protein product [Mus musculus] 
Length = 970 

Score = 283 bits (724), Expect = 9e-75 

Identities = 194/595 (32%), Positives = 300/595 (50%), Gaps = 50/595 (8%) 

Query: 265 LFSPSDHRVPRIYVPLK- -DCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLG 322 

L +P D+R+P+I + •+ + QDF + RI W+ + G + LG 

Sbjct: 283 LVTPWDYRIPKIRISTQQAEALQDFRV WR IDS WEAT S VY PNGHF VR VL G 332 

Query: 323 QAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSL PWT I P PD E VGKRRDLRKD - C I F 378 

+ G++E E IL E + FS + +P + PW + P E +R+DLR +F 

Sbjct: 333 RIGDLEGEIATILVENSISWPFSEAQMCEMPVNTPENPWKVSPKEEQERKDLRTTHLVF 392 

Query: 3 79 T I D P STARD LDDALACRRLTDGT FE VGVH I ADVS YF VPEGS S LDKVAAERAT S VYLVQKV 43 8 

+ IDP D+DD L+ R L +G E+GVHIADV++FV S +D A RAT+ YL + 
Sbjct: 3 93 SIDPKGCEDVDDTLSVRTLNNGNLELGVHIADVTHFVAPNSYIDVEARTRATTYYLADRR 4 52 

Query: 439 VPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEG-KILEEWFGRTIIRSCTKLSYDHAQS 497 

MLP +L +LCSL D+ SV+W+L +1 + W+GRTIIRS KL Y+ AQ 

Sbjct: 453 YDML P S I L S ADLCS LLGGVDRYAVS VMWE LDKTS YE I KKVWYGRT 1 1 RS AYKLFYEAAQE 512 

Query: 498 MIENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLK 553 

+ + + S +EE+ A+ L IA+ +R +R GAL L+ + + 

Sbjct: 513 LLDGNFSIVDDIPELKALDKQSQQAKLEELVWAIGKLTDIARHIRAKRDRCGALELEGVE 572 

Query; 554 LAFTLDHETGL PQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRH 608 

+ LD + + Q ++E V E M+LAN VA KI+ +FP QALLR+H 

Sbjct: 573 VRVQLDDKKNIRDLIPKQPLEVHE TVAECMILANHWVAKKIWESFPHQALLRQH 626 

Query: 609 PPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQ 668 

PPP - + S+L E G +D S L SL D K L K +L +M ++ M 

Sbjct: 627 PPPHQEFFSELRECAKAKGFFIDTRSNKTLADSLDSA-NDPKDPLWK-LLRSMATQAMS 684 

Query: 669 MALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPD 728 

ALYF +G + E+F HY L + YTHFTSPIRR++D++VHRLL AA+ ++ +++ + 
Sbjct: 685 NALYFSTGSCA-EEEFHHYGLALDKYTHFTSPIRRYSDIAAmRLLKAAISKDKKMEIKEN 743 

Query: 729 T LQ KQADH CND RRMAS KR VQ E L S I GL F F AVL VKE S G P LE S E AMVMGVLNQ 778 

L++ H N+R A++R Q+ S LF + K+ E ++ + 

Sbjct: 744 LFSNKNLEELCRHINNRNRAAQRSQKQSTELFQCMYFKDRDAETEERCIADGVIYSIRTN 803 

Query: 779 AFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVIT 83 3 

V + RFG+ + Y L++ + PE + W+P L+ + + T 

Sbjct: 804 GVL VF I PRFG I KGAA Y LKNKDSLVISCGPEGSSEWKPGSLQRSQNKIIST 853 



Score = 54.7 bits (130), Expect = 7e-06 

Identities = 33/103 (32%), Positives = 54/103 (52%),' Gaps = 8/103 (7%) 

Query: 50 FETYMSKEDVSEGLKRGTLIQGVLRINPKKFH - EAFI PSPDGD- -RDIFIDGWA 101 

+ ++ E + G+K G IQG+L +N + EAF+ S D DI I G A 

Sbjct: 142 Y P EHL PL E VL E AG I KS GR Y I QG I LNVNKHRAQ I E AF VRLHGAS S KD S GL VS DILI HGS KA 201 

Query: 102 RNRALNGDLVWKLLPEDQWKAVKPESNDKEIEATYEADIPEE 144 

RNR+ ++GD+VW++LP+ +WK + + + + P E 

Sbjct: 202 RNRS I HGD WWEML PKS EWKGRTAALGENDS DDKASGE S P S E 244 

>gi | 14250916 | emb | CAC39263 . 1 | putative ribonuclease II-like protein 
[Trypanosoma brucei] 

Length = 895 

Score = 278 bits (710), Expect = 4e-73 

Identities = 236/763 (30%), Positives = 352/763 (46%), Gaps = 109/763 (14%) 



Query: 


52 


Sbj ct : 


18 


Query: 


111 


Sbjct : 


78 


Query: 


166 


Sbjct : 


123 


Query: 


226 


Sbjct : 


159 


Query: 


284 


Sbjct : 


215 


Query : 


337 


Sbjct: 


271 


Query : 


393 


Sbjct : 


329 


Query : 


453 


Sbjct: 


388 


Query: 


513 


Sbjct': 


448 


Query : 


567 


Sbjct : 


508 


Query: 


627 


Sbj Ct : 


563 


Query : 


685 


Sbj Ct : 


622 


Query : 


731 


Sbjct : 


682 



++ + +K G ++ G LR+ + AF+ S D+ + G + NR. L+ D+ 



V V+LLP +QW+ V +P+ +DK+ T D L R+ I 
vavnT.T.PMROWT^nW.ciriRT.RPDGDDKDEFRTMRPD SERLPDGRR 1 122 



D+ +R L + +S P P 

•DTTMNNRKNREMWLAE MMSAPTEHNWHGKKP 158 



S V+ +LE+KH L A + ++ + + + F D +P I V +D 



R+ ++R+ I W E F ++ + G ' + T I + 



+ + 



DFS E C+P L IP E " K RRDLR+ + '+ +IDP+TARDLDDAL 



+ L G + VGVHIADVS + FV GS+LD+ RATSVYLV +V MLPR L EE CS 



L+P +DKL FS I++L GK+ EWFG+++IR+ +LSYD AQ +1+ 



+ + v +V +L +A +LR FGL ++F + + P 

448 GVTDRRELSQLKARVATSVKHLFDLASKLRAASFERGRLAFSTPEIGFHFEDISNPTHPI 507 



G +++ ++N LVEEFMLLAN+ VA KI + P+QA+LR HPPP+ L ++ 
508 GFNVHRQIEANWLVEEFMLLANLRVAQKIVQYLPDQAILRVHPPPKRVPFEQL KV 562 



+++ 



NKSL + + R E+ + M + +A Y +G ++ 



'HYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSE QPDVEPDTL 73 0 

HYAL + YTHFTS PIRR+AD+ 1 VHR L AL +E + L 



AD CN+ + A+ V E S+ LFF +K 
JSAEFFTSTYEVMNIADECNENKRAADSVSEASLKLFFCHYLK 724 

>gi | 15559519 | gb | AAH14124 . 1 1 AAH14124 Unknown (protein for IMAGE : 4561365) 

[Homo sapiens] 

Length =64 7 

Score = 270 bits (689) , Expect = le-70 

Identities = 178/534 (33%), Positives = 276/534 (51%), Gaps = 38/534 (7%) 

Query: 315 GQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSL PWTIPPDEVGKRRD 371 

G + LG+ G++E E IL E + FS + +P + PW + P+E KR+D 

Sbjct: 1 GTRVRVLGRIGDLEGEIATILVENSISVIPFSEAQMCEMPVNTPESPWKVSPEEEQKRKD 60 

Query: 372 LRKD-CIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERAT 430 

LRK +F+IDP D+DD L+ R L +G E+GVHIADV++FV S +D A RAT 



Sbjct : 61 LRKSHLVFSIDPKGCEDVDDTLSWTLNNGNLELGVHIADVTHFVAPNSYIDIEARTRAT 12 0 



Query: 431 SVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEG- KILEEWFGRTI IRSCTK 489 

+ YL + MLP +L +LCSL D+ S++W+L +1 + W+GRTIIRS K 

Sbjct: 121 TYYLADRRYDMLPSVLSADLCSLLGGVDRYAVSIMWELDKASYEiKKVWYGRTIIRSAYK 180 

Query: 490 LSYDHAQSMIENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHS IAKQLRRQRFVDG 545 

L Y+ AQ + + + S +EE+ A+ L IA+ +R +R G 

Sbjct: 181 LFYEAAQELLDGNLSWDDIPEFKDLGEKSRQAKLEELVWAIGKLTDIARHVRAKRDGCG 240 

Query: 546 ALRLDQLKLAFTLD HETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFP 600 

AL L+ + + + LD H+ Q ++E V E M+LAN VA KI+ +FP 

Sbjct: 241 ALELEGVEVCVQLDDKKNIHDLIPKQPLEVHE T VAE CM I LANH WVAKKI WE S F P 294 

Query: 601 EQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLT 660 

QALLR+HPPP + S+L E G +D S L SL +D + +L 

Sbjct: 295 HQALLRQHPPPHQEFFSELRECAKAKGFFIDTRSNKTLADSLDN--ANDPHDPIVNRLLR 352 

Query: 661 NM YS R PMQMAL Y F C S GML QDQ E Q F RH YALNVPL YTH FT S P I RRF AD V I VHRL LAAALGYS 720 

+M ++ M ALYF +G +E-J-F HY L + YTHFTSPIRR++D++VHRLL AA+ 
Sbjct: 353 SMATQAMSNALYFSTGS CA- EEEFHHYGLALDKYTHFTS P I RRYSD I WHRLLMAAI SKD 411 

Query: 721 EQPDVEPD TLQKQADHCNDRRMASKRVQELS IGLFFAVLVKESGPLE SEA 770 

+ + + + + + L+ + H N+R A++ Q+ S LF + K+ P S + 

Sbjct: 412 KKMEIKGNLFSNKDLEELCRHINNRNQAAQHSQKQSTELFQCMYFKDKDPATEERCISDG 471 

Query: 771 MVMGVLNQAFDVLVLRFGVQKRIYCNALALRSYSFQKVGKKPELTLVWEPDDLE 824 

+ + + + + RFG++ Y L++ + P+ W+P L+ ■ 

Sbjct: 472 VI YS I RTNGVLLF I PRFGI KGAAY LKNKDGLVISCGPDSCSEWKPGSLQ 520 

>gi | 23136616 \ gb | ZP_00118334 . 1 | hypothetical protein [Cytophaga 
hutchinsonii] 

Length = 750 

Score = 265 bits (676) , Expect = 3e-69 

Identities = 168/466 (36%) , Positives = 252/466 (54%) , Gaps = 35/466 (7%) 

Query: 315 GQ LAKSLGQAGE I EPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRK 374 

G++ + LG AGE EE IL E+G+ + +F + +V+ + Q + E+ KRRD+R 

Sbjct: 219 GKVTEVLGMAGENETEMHAILAEFGLPY-EFPNQVIKA-AQDITAETSEAEIKKRRDMRG 276 

Query: 3 75 DCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYL 434 

FTIDP A+D DDA+ + + L +G +E+GVHIADVS++V G + LDK A RATSVYL 
Sbjct: 277 TTTFTIDPEDAKDFDDAISFKYLDNGNWEIGVHIADVSHYVQPGDTLDKEAYRRATSVYL 336 

Query: 435 VQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDH 494 

V + VPMLP L ELCSL P DKLTFS I+++ + K+L+EWFGRTII S + SY+ 
Sbjct: 337 VDRCVPMLPERLSNELCSLRPNEDKLTFSAIFEIDADAKVLDEWFGRTIIHSTRRFSYEQ 396 

Query: 495 AQSMIENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHS IAKQLRRQRFVDGALRLDQLKL 554 

AQ +IE ++ + + L +++ AK++R QRF +GA+ + +++ 

Sbjct: 397 AQEVIETQQG DLVKELTILNTLAKKMRAQRFKEGAISFETVEV 439 

Query: 555 AFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIF RTFPEQALLRRHP 6 09 

F LD + G P +D++KL+EEFMLLAN VA +F + + R H 

Sbjct: 440 KFQLDAK-GKPLAWPKVRKDAHKilEEFMLLANKRVAEFVFNLRKGKGTSNTMVYRTHD 498 

Query: 610 PPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQM 669 

P + L+ L F + G +++ A+ K+L K D+ + VL ++ R M 

Sbjct: 499 APNPEKliASLATFAKRFGHKVELDDENAIAKNLNK-LSDEVEGKPEQOTLQSLAIRTMSK 557 

Query: 670 ALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDT 729 

A+Y E H+ L Y+HFTSPIRR+ DV+ HRLL L + D E 

Sbjct: 558 AIY SIEPDMHFGLAFKHYSHFTSPIRRYPDVMAHRLLQHYLDGGKSADKE- -Y 608 



Query: 73 0 LQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGV 775 

+ +H + + + + SI ++ S E E +V GV 

Sbjct: 609 YIEACEHSSAQEKLAAEAERSSIKFKQVEFMQSSIGKEFEGIVSGV 654 

>gi | 19173030 | ref | NP_597581 . 1 | similarity to DIS3 PROTEIN (RNASE. II FAMILY) 
t En c epha 1 i t o z oon 

cuniculi] 

gi[ 19168697 |emb|CAD26216.l| similarity to DIS3 PROTEIN (RNASE II FAMILY) 
[Encephalitozoon 

cuniculi] 
Length = 83 5 

Score = 260 bits (665) , Expect = 6e-68 

Identities - 161/460 (35%), Positives = 240/460 (52%), Gaps = 64/460 (13%) 

Query: 265 LFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQA 324 

L P D R+P I + N I W+ N+ G + LG 

Sbjct: 288 LVIPIDRRIPAIRIR TSQAEELMNKRLCVEIDGWERTSNYPSGHYYRRLGTL 339 

Query: 325 GEIEPETEGILTEYGVDFSDFS-SEVLECLPQSLPWTIPP DEV- -GKRRDLR 373 

G+ E E +L G+ + + + +E+L +PW + EV G R D R 

Sbjct: 34 0 GDRNAEMEAVLVANGI TYYNKNWAEI LR VPWMNEDIFGMEKAYNEVREGTREDFR 3 94 

Query: 3 74 KDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVY 43 3 

■+ I +IDP D+DDAL CR L +G +EVGVHIADV+Y+V +GS +DK+AA+R T++Y 
Sbjct: 395 ELSIVSIDPPDCEDIDDALHCRLLPNGNWEVGVHIADVTYYVTKGSEIDKIAADRGTTIY 454 

Query: 434 LVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYD 493 

L + + MLP L +LCSL D+ FSV+W+++ + + + + F R++IRS SY+ 
Sbjct: 455 LPEWRIDMLPPALSTDLCSLVAGBCDRAAFSWWEMSSDVRWRTHFCRSLIRSKRSFSYN 514 

Query: 4 94 HAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLK 553 

A ++ +EV + + + L I + + LR +RF G+L L + 

Sbjct: 515 EAYDAVQGLVDAG DEVSRSLRMLLEISRVLRSRRFAKGSLDLSTRQ 560 

Query: 554 LAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQT 613 

L F G + EY +N LVEEFM+LAN++VA 1+ P+ +LLR+HPPP 

Sbjct: 561 LVFR EGGFEMKEYFPTNFLVEEFMVLANISVASFIYHHHPDSSLLRKHPPPSV 613 

Query: 614 KMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALYF 673 

+GL +D SS AL++SL K G RK++ M R M A+Y 

Sbjct: 614 L DIGLDIDTSSPRALSESLEKMEG VRKDLAKRMLIRSMNQAVYV 657 

Query: 674 CSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLL 713 

SG + HY L P+YTHFTSPIRR+AD+IVHR+L 

Sbjct: 658 VSG ETPSLHHYGLATPIYTHFTSPIRRYADIIVHRIL 694 

>gi | 7493807 |pir | | T30524 protein phosphatase Ssdl homolog - yeast (Candida 
albicans) 

gi | 2459997 | gb | AAC83386 . 1 | protein phosphatase Ssdl homolog [Candida 
albicans] 

Length = 1262 
Score = 246 bits (628) , Expect = le-63 

Identities = 230/859 (26%), Positives - 391/859 (45%), Gaps = 86/859 (10%) 

Query: 4 9 I FET YMS KED VS EGLKRGTL I QG VLR INPKKFHEAF IPS PDG - - DRD I F I DG WARNRAL 106 

+F Y+ + + E + G L+ G LR+N K +A++ S DG D DIFI G RNRAL 
Sbjct : 33 6 LFAPYLPQSSLPELINEGRLVTGTLRVNKKNRSDAYV-STDGLLDADIFICGSKDRNRAL 3 94 

Query: 107 NGDLVWKLLPEDQ-WKAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVII 165 

GDLV V+LL D+ W++ K + K + P H+ S P+ 

Sbjct-: 395 EGDLVAVELLIVDEWESKKEKEEKKRRKDNTLHSRPLTDDIHN DATSAPNT-A 447 



Query: 166 EAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPVMKDENTPIP-QDTRGLSE 224 



E + ED G+ G+ + RG + K+ + + + Q + E 

Sbjct : 448 EGSVTGTSKEDGAGSNEEETGGLAR RGSLKQRPTMKKNDDVEVEGQSLLLVEE 500 



Query: 225 KSLQKSAK WYILEKKHSRAATGILKLL ADK NSDLFK 261 

+ + K W + + + + + G L LL DK ' + 

Sbjct: 501 EEINDEIKPLYAGHWAWDRIPGQLFAGTLGLLRPAQAAQAARDKKNGKESTVQNPKAP 560 

Query : 2 62 KYALFSPSDHRVPRI YVPLKDCPQDFMTRPKDFANTLFICRI I D WKEDCNFALGQLAKS L 321 

K F P+D +VP I +P + P+DF+ + +A+ LF+ I W G L +L 

Sbjct : 561 KIVWFKPTDKKVPLIAIPTEQAPKDFVENHEKYADRLFVASIKRWPITSLHPFGTLVSNL 620 

Query: 322 GQAGEIEPETEGILTEYGVDFSDFS S EVLECLPQSLPWT I PPDEVGKRRDLRKDC I 3 77 

G E E + IL + ++ +++ LP P E +R + D I 

Sbjct : 621 GPIDSPETEIDSILRDNNFLCDEYPDDDNDDIVSWAYDLPSIEPEFENTQREEYLNDYI 680 

Query: 3 78 FTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQK 43 7 

+D AL +R+. + + E+G H+AD++YF+ GSSLD+ + +R++SV+L QK 
Sbjct: 681 I AFT - QNGEF VDHALHVKR I SNTKI ELGFHVAD I A YF I KPGS S LDRKS KKRS S S VFLPQK '739 

Query: 43 8 WPMLPRLLCEELCSLNPMTDKLTFSVIWKL-TPEGKILEEWFGRTIIRSCTKLSYDHAQ 4 96 

V + P+ + ++ S L SV++++ T ++ + + ++I ++YD 

Sbjct : 740 TVNLFPKQV-NKIVSFKENEKNLAVSWFEIDTSNFEVEDLYIHESVIIPKQLVTYDAFD 798 

Query: 497 SMIENXXXXXXXXXXXXXXXXHSVEEVHQA VLNLHS I AKQLRRQRFVDGALRLD - Q 551 

+ + + SV+ + . A V IAK+ RR R + +L + 

Sbjct: 799 TILSG . QSVDSISSATSDYVKTFSLIAKEFRRHRLSNRSLGITPN 842 

Query : 552 LKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPP 611 

L L LD E + +I + + + ++ E N A+A K+ +QA+LRRHP P 

Sbjct : 843 LTLLDQLDDEK- VRLDLNT FKDSLAFDVT S E I SHKVNSAIAAKVHAGLGDQAI LRRHP LP 901 

Query : 612 QTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMAL 671 

+ + V +G +D + + + L S+ K DD + RK V T +Y + M 

Sbjct : 902 TLQKMETFVRKATSLGFKIDTTTSSTLQNSILKI - -DD- - PVKRKCVETLLY-KCMSRGR 956 

Query: 6 72 YFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQ 731 

Y+ +G QD + + HY N+PLYTHFT+P+RR+AD+IVHR L A L + D + D+L+ 
Sbjct ; 957 YYVAGK- QDTDSYAHYYFNLPLYTHPTAPLRRYADLIVHRQLKAVLN- KQVEDKDLDSLK 1014 

Query: 732 KQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESEAMVMGVLNQ AFDVLVLRF 787 

D+CN ++ + QE +1 L + + E + + MG + Q +FDV + F 

Sbjct: 1015 AITDYCNFKKDCAANAQEQAIHLLLSQTINEMSETAGQLLCMGTWQVYESSFDVFIPEF 1074 

Query: 78 8 GVQKRXYCNALALRSYSFQKVGKKPELTLVWEP- -DDLEEEPTQQVITIFSLVDWLQAE 84 5 

GV+KR++ + LL FK + L L WE D P + ++ + + 

Sbjct: 1075 GVEKRVHGDQLPL VKAEFDK - - NERI LELWWEKGVDSAT YI P PDEKS S LS YRNS I KNKYR 1132 

Query: 84 6 ATALKYSAI LKRPGLEKAS 864 

+AL+ +1 + LEK+ + 
Sbjct: 1133 TSALQAAKIQSKTALEKST 1151 

>gi [22748821 | ref |NP_689596 . 1 | hypothetical protein MGC42174 [Homo sapiens] 

gi| 20988352 |gb|AAH30113,l| similar to Hypothetical 93,7 kDa protein 
F48E8.6 in chromosome III 

[Homo sapiens] 
Length = 15 9 

Score = 242 bits (617), Expect = 2e-62 

Identities = 119/135 (88%), Positives * 127/135 (94%) 

Query: 727 PDTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKESGPLESE^MVMGVLNQAFDVLVLR 786 

PDTLQKQADHCND RMASKRVQELS LFFAVLVKESGPLESEAMVMG+L QAFDVLVLR 
Sbjct: 3 PDTLQKQADHCNDSRMASKRVQELSTSLFFAVLVKESGPLESEAMVMGILKQAFDVLVLR 62 

Query: 787 FGVQKRIYCNALiALRSYSFQKVGKKPELTLVWEPDDLEEEPTQQVITIFSLVDvVLQAEA 846 



+GVQKRIYCNALALRS + FQKVGKKPELTLVWEP+D+E+EP QQVI T I F S LV+ WLQAE 
Sbjct: 63 YGVQKRIYCNALALRSHHFQKVGKKPELTLVWEPEDMEQEPAQQVITIFSLVEWLQAEY 122 

Query: 84 7 TALKYSAI LKRPGLE 8 61 

TALKYSAI LKRPG + 
Sbjct; 123 TALKYSAI LKRPGTQ 13 7 

>gi (27685983 | ref [XP_237349 . 1 1 similar to hypothetical protein MGC37640 [Mu 
musculus] [Rattus 

norvegicus] 
Length = 1034 

Score = 241 bits (614) , Expect = 5e-62 

Identities = 117/128 (91%), Positives = 121/128 (94%), Gaps = 2/128 (1%) 

Query: 188 VKKLSISTPDRGKEDSSTPVMKDENTPIPQDTRGLSEKSLQKSAKVVYILEKKHSRAATG 247 

+KK+S S + DR KEDSS PVMKDENTP QDTR LSEKSLQKSAKWYILEKKHSRAATG 
Sbjct: 509 LKKIS-SSNDR-KEDSSAPVMKDENTPKSQDTRALSEKSLQKSAKWYILEKKHSRAATG 566 

Query: 248 ILKLLADBCNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWK 3 07 

ILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKD+ANTLFICRIIDWK 
Sbjct: 567 ILKLLADKNSDLFKKYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDYANTLFICRIIDWK 626 

Query: 3 08 EDCNFALG 315 

EDCNFALG 
Sbjct: 627 EDCNFALG 634 



Score = 142 bits (357) , Expect = 3e-32 
Identities = 64/80 (80%) , Positives = 70/80 (87%) 

Query: 122 KAVKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVIIEAQFDDSDSEDRHGNT 181 

+ VKPESNDKE EAT E DI EE CGHH LQ+S KGW+GPDVI IEAQFDDSDSEDRHGNT 
Sbjct: 328 RGVKPESNDKETEATNETDISEESCGHHLLQKSPKGWNGPDVT IEAQFDDSDSEDRHGNT 387 

Query: 182 SGLVDGVKKLS I STPDRGKE 2 01 

SGLVDG+KKLS+ TPD+G E 
Sbjct: 388 SGLVDGMKKLSVCTPDKGSE 407 



Score = 127 bits (320) , Expect = 6e-28 
Identities = 58/62 (93%) , Positives = 60/62 (96%) 

Query; 316 QLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKD 3 75 

QLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPP+EVGKRRDLR 
Sbjct: 724 QLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPEEVGKRRDLRSL 783 

Query: 376 CI 377 

Sbjct: 784 CL 785 



Score = 117 bits (292) , Expect = 9e-25 
Identities = 52/57 (91%), Positives = 57/57 (100%) 

Query: 68 LIQGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNGDLVWKLLPEDQWKAV 124 

+ + + GVLRINPKKFHEAF I PS PDGDRD I FIDG WARNRALNGDL VWKLL PEDQWK+ + 
Sbjct: 205 VLKGVLRINPKKFHEAFIPSPDGDRDIFIDGWARNRALNGDLVWKLLPEDQWKSL 261 



Score = 60.1 bits (144), Expect = 2e-07 
Identities = 25/28 (89%) , Positives = 28/28 (100%) 

Query: 373 RKDC I FT ID P S TARDLDDALACRRLTDG 400 

+KDCIFTIDPSTARDLDDAL+CR+LTDG 



SbjCt: 8 67 QKDCIFTIDPSTARDLDDALSCRQLTDG 8 94 



>gi | 6320499 | ref | NP_010579 . 1 | Product of gene unknown; Ssdlp [Saccharomyces 
cerevisiae] 



gi 
gi 
gi 



134917 
101639 
172612 
172697 



sp|P24276 |SSD1__YEAST SSD1 PROTEIN (SRK1 PROTEIN) 

pir| \R39578 SSD1 protein - yeast (Saccharomyces cerevisiae) 



gb 
gb 



AAA35047 . 1 
AAA35089.1 



SSD1 protein 
SRK1 

1230657 |gb|AAB64469.l| Ssdlp [Saccharomyces cerevisiae] 
Length =1250 



Score = 221 bits (564),. Expect = 3e-56 

Identities = 227/823 (27%), Positives = 369/823 (44%), Gaps = 95/823 (11%) 

Query; 4 9 I FET YMS KEDVS EGLKRGTL I QG VLR I NPKKFHEAF I PS PDG - - DRD I F IDG WARNRAL 106 

+F Y+ + + + E ++ G L+ G-f-LR+N K +A++ S DG D DI+I G RNRAL 
SbjCt : 340 LFAPYLPQANIPELIQEGRLVAGILRVNKKNRSDAW-STDGALDADIYICGSKTJRNRAL 398 

Query : 107 NGDLVWKLL-PEDQWKA-VKPESNDKEIEATYEADIPEEGCGHHPLQQSRKGWSGPDVI 164 

GDLV V+LL +D W+ + + E + +A+ + D+ PL S + V 

Sbjct: 399 EGDLVAVELLWDDVWESKKEKEEKKRRKDASMQHDLI PLNS SDDYHNDAS VT 451 

Query : 165 IEAQFDDSDSEDRHGNTSGLVDGVKKLSISTPDRGKEDSSTPV MKDENTPI 215 

+ S + S V++ ST + + S + P +K T 

SbjCt: 452 AATSNNFLSSPSSSDSLSKDDLSVRRKRSSTINNDSDSLSSPTKSGVRRRSSLKQRPTQK 511 

Query: 216 PQDTRGLSEKSL QKSAKWYILEKKHSRAATGILKLL ADKNSD 258 

D + +SL + W +L++ + +G L LL NSD 

Sbjct : 512 KNDDVEVEGQSLLLVEEEEINDKYKPLYAGHWAVLDRIPGQLFSGTLGLLRPSQQANSD 571 

Query: 259 LFK KYALFSPSDHRVPRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFA 313 

K K A F P+D +VP I +P + P+DF+ ++ LF+ I W 

Sbjct : 572 NNKPPQSPKIAWFKPTDKKVPLIAIPTELAPKDFVENADKYSEKLFVASIKRWPITSLHP 631 

Query: 314 LGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLEC LPQSLPWTIPPDEVG 3 67 

GLLG ++E+IL D+ S+E L+ P P + + + 

Sbjct : 632 FGILVSELGDIHDPDTEIDSILR DNNFLSNEYLDQKNPQKEKPSFQPLPLTAESLE 687 

Query : 368 KRR DLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKV 424 

RR D + IF I + AL R +GT E+G H+ DV+ + EGSS+D+ 

Sbjct : 688 YRRNFTDTNEYNI FAI S - ELGWVS EFALHVRNNGNGTLELGCHVVDVTSHI EEGS SVDRR 746 

Query: 425 AAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEG-KILEEWFGRTI 483 

A +R+++V++ QK+V +LP+ +EL SL P + T SV++ L +1 W G + 

Sbjct : 747 ARKRSSAVFMPQKLVNLLPQSFNDEL-SLAPGKESATLSWYTLDSSTLRIKSTWVGEST 805 

Query: 484 I R S CTKL S YDHAQS M I ENXXXXXXXXXXXXXXXXH S VE E VHQAVLNLH S I AKQLRRQRFV 543 

I LS + + S + + IA+ +R 

Sbjct: 806 ISPSNILSLEQLDEKL STGSPTSYLSTVQEIARSFYARRIN 846 

Query : 544 D-GALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQ 6 02 

D A L L X L +LD E + +1 + ++ E N VA KI+ + 

Sbjct : 84 7 DPEATLLPTLSLLESLDDEK-VKVDLNILDRTLGFWINEIKRKVNSTVAEKIYTKLGDL 905 

Query : 603 ALLRRHPPPQTPCMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNM 662 

ALLRR P ++ + G D ++A LK + K DD R + + 

Sbjct: 906 ALLRRQMQP I ATKMAS FRKKI QNFGYNFDTNTADELI KGVLKI KDDD VRVGIEILL 961 

Query : 663 YSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVI VHRLLAAALGYSEQ 722 

+ + M A YF +G + D +Q+ H YALN + P + YTHFT + P +RR + AD +VHR L A + + 
Sbjct : 962 F-KTMPRARYFIAGKV-DPDQYGHYALNLPIYTHFTAPMRRYADHVVHRQLKAVI - -HDT 1017 

Query: 723 PDVEP-DTLQKQADHCNDRRMASKRVQELSIGLFFAVLVKE SGPLESEAMVMGVLN 777 

P E + L+ +++CN ++ + + QE +1 L + + +G L + A V+ V 

Sbjct: 1018 PYTEDMEALKITSEYCNFKKDCAYQAQEQAIHLLLCKTINDMGNTTGQLLTMATVLQVYE 1077 



Query: 778 QAFDVLVLRFGVQKMYCNALALRSYSFQKVGKKPELTLVWEP 82 0 

+ FDV + FG++KR++ + L L F G L L W+P 

Sbjct: 1078 SSFDVFIPEFGIEKRVHGDQLPLIKAEFD--GTNRVLELHWQP 1118 

>gi | 15616115 | ref jNP_244420 . 1 | ribonuclease R; virulence-associated protein 
[Bacillus halodurans] 

gi | 25298858 |pir j | A84094 virulence-associated protein BH3553 [imported] - 
Bacillus 

halodurans (strain C-125) 
gi | 10176177 |dbj |BAB07272 . 1 | ribonuclease R [Bacillus halodurans] 
Length = 771 

Score = 214 bits (545) , Expect = 5e-54 

Identities = 163/515 (31%), Positives = 250/515 (48%), Gaps - 47/515 (9%) 

Query: 232 KWYILEKi^SRAATGILPCLLADKNSDLFKKyALFSPSDHRVPRIYVPLKDCPQDFMTRP 291 

KV+ ILE R+++ D + Y L D R+P + + Q + 

Sbjct: 127 KVIRILE RGVS EVTGTYVDHQT YGLWADDKRIPNDILIVNPEAQGLVDGH 177 

Query: 292 KDFANTLFICRIIDWKEDCNFALGQLAKS LGQAGEIEPETEGILTEYGVDFSDFSSEVLE 351 

K I +1 + E A G++ LG + + 1 + +YG+ F EVL+ 

Sbjct: 178 K VI VKITKYPEGRMSAEGEVIS ILGHKNDPGMDILS 1 1 YKYGI P - QAFPEEVLQ 230 

Query: 3 52 CL PQS LPWT I P PDEVGKRRDLRKDC I FT I DP S TARDLDDALACRRLTDGTFEVGVH I AD V 411 

Q+ + P I E+ RRDLR + + TID + A+DLDDA+ RL +G F++GVHIADV 
Sbjct : 231 - QAQN I PD E I DAGE I KNRRDLRDEPL VT I DGADAKDLDDAVHVS RL PNGNFKLGVH I AD V 289 

Query: 412 SYFVPEG3SLDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPE 471 

SY+V E S +D AAER TSVYLV +V+PM+P L +CSLNP D+LT S ++T + 
Sbjct: 290 S YYVKEKS P IDLEAAERGTS VYLVDR VI PMI PHRLSNGI CS LNPQVDRLTLS CEME I TRD 349 

Query: 4 72 GKI LEE WFGRT 1 1 RS CTKLS YDHAQSM I ENXXXXXXXXXXXXXXXXHS VE E VHQAVLNLH 531 

G+ + + +++IR+ +++Y ++ +EE 
Sbjct: 350 GQ VVNHE I FQS VI RTNERMTYHDVNKI LVEKEEELRNR YELL VPMFEDMEE 400 

Query: 532 S I AKQLRRQRFVDGALRLDQLKLAFTLDHETGL PQGCH I YE YRD SNKL VEE FMLLANMAV 591 

+A LR++RF GA+ DKA LEGP "IE + KL+EEFML AN V 
Sbjct: 401 -LAAILRKKRFGRGAIDFD-FKEAKVLVDEEGKPSDVVIRERGVAEKLIEEFMLAANETV 458 

Query: 592 AHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVS SAGALNKSLTKTFGDDKY 651 

A F +RHP++L +EF G + ++ ++L K ++ 

Sbjct: 459 AEH-FHWLKLPFMYRIHEDPDSEKLGRFLEFITNFGYWRGTANTVHPRALQKLL-EEVR 516 

Query: 652 SLARKEVLTNMYSRPMQiVIALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHR 711 

+ +V+ + + R MQ A Y D H+ L+ YTHFTSPIRR+ D+IVHR 

Sbjct: 517 GEPEEQVI STVMLRSMQQAKY DPTSLGHFGLSTEFYTHFTSPIRRYPDLIVHR 569 

Query: 712 LLAAALGYSEQPDVEPDTLQKQ ADHCND 73 9 

L+ Y + +V+ +T +K A HC++ 

Sbjct: 570 LIRE YLIKGNVDEETQEKWRERLPEIARHCSE 601 



>gi 



23099883 
22778114 



ref |NP_693349 . 1 | ribonulease R [Oceanobacillus iheyensis] 
dbj | BAC14384 . 1 | ribonulease R [Oceanobacillus iheyensis] 



Length = 744 



Score = 209 bits (533), Expect = le-52 

Identities = 140/418 (33%) , Positives = 214/418 (51%) , Gaps = 22/418 (5%) 

Query: 3 00 ICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPW 359 

I RI + E N A G++ LG + + 1 + ++G+ DF +VLE Q P 
Sbjct; 183 IARISKYPEGRNSAEGEIIHILGHKNDPGIDILSIIHKHGIQV-DFPEDVLE-QAQQTPE 240 

Query: 360 TIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGS 419 

I PDE+ RRDLR + I TID + A+DLDDA+ ++L++G +++GV+IADVS++V E S 



Sbjct: 241 EIHPDEINGRRDLRNETIVTIDGADAKDLDDAVTVKKLSNGNYKLGVYIADVSHYVKEES 300 



Query: 42 0 SLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWF 4 79 

+D+ A ER TSVYLV +V+PM+P L +CSLNP D+LT ++ +G+++E 

Sbjct: 301 PIDREALERGTSVYLVDRVIPMIPHRLSNGICSLNPKVDRLTLGCEMEINHQGQWEHEI 360 

Query: 480 GRTI IRSCTKIjSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHS IAKQLRR 539 

S + ++Y ++ + +E NL SI LR 
Sbjct: 361 FQS VI HS TERMT YSD VNKI L VDQDEELRDKYE PL VPMFEEME NLASI LRE 410 

Query: 54 0 QRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTF 599 

+RF GA+ D KAL + GP IE + +L+EEFML AN +A F 

Sbjct: 411 KRFGRGAIDFD-FKEAQVLVDDMGRPTDVAIRERSVAERLIEEFMLAANETIAEH-FHWM 468 

Query: 600 PEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVL 659 

+ R H P L +F +G+ + +AG ++ + D + + + 

Sbjct: 469 DVPFIHRIHEEPDEGKLEKFFDFLAGLGISVK-GTAGDIHPQELQKVIDSIRGEPEEMIV 527 

Query: 660 TNMYSRPMQMALYFCSGMLQDQEQFRHYALOTPLYTHFTSPIRRFADVIVHRLLAAAL 717 

+ +RMQAY D + H+L • YTHFTSPIRR+ D+IVHRL+ L 

Sbjct: 528 SKLMLRSMQQAKY DPQS IGHFGLATDFYTHFTS P I RRYPDL I VHRL I RTYL 578 

>gi ] 21673344 | ref | NP_661409 . 1 | ribonuclease II family protein [Chlorobium 
tepidum TLS] 

gi | 21646438 | gb | AAM71751 . 1 | ribonuclease II family protein .[Chlorobium 
tepidum TLS] 

Length = 720 

Score = 208 bits (529) , Expect = 4e-52 

Identities = 155/466 (33%), Positives = 233/466 (50%), Gaps = 57/466 (12%) 

Query: 265 LFSPSDHRV- PRIYVPLKDCPQDFMTRPKDFANTLFICRIIDWKEDCNFALGQLAKSLGQ 323 

LP + + P I++PLK + KAL + "+ + + LG 

Sbjct: 154 LLKPDQRKILPEIHIPLKAAKKAKAG-DKVLAGELEFLK SGTIQARVIEILGT 205 

Query: 324 AGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPS 383 

AGE + E I G+D F E+L + + I +++ +R D+R +FTIDP 

Sbjct: 206 AGESQVEVSAIARGLGID-ETFEPELL-TFAEKVREAITDEDLKERLDIRDKDVFTIDPV 263 

Query: 384 TARDLDDALACRRLTD-GTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKVVPML 442 

A+D DDAL+ L + G ++VGVHIADVS ++VPE S+LDK A +RATSVYLV +V+PML 
Sbjct: 264 DAKDFDDALSIETLGNGGGYKVGVHIADVSHYVPENSALDKEARKRATSVYLVDRVIPML 323 

Query: 443 PRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENX 502 

P L E++CSLNP D+L FSV + +T +G++ + F +T+I S + +Y+ Q +++ 
Sbjct: 324 PSRLSEKVCSLNPGVDRLAFSVFFNITKKGEVTKFEFHKTVIHSKRRFTYEDVQQILDAG 383 

Query: 503 XXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHET 562 

+ + + L ++K++R QR G L + ++ F L 
Sbjct: 384 KG DYFRELQALDQLSKKIRAQRMESGGLEFETEEVRFKLG- SN 425 

Query: 563 GLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTF PEQALLRRHPPPQTKML 616 

G P E DS++L+EEFMLLAN VA+ + P +RHPQ++ 

Sbjct: 426 GE P VE VI KKE RLD S HRL I E E FML LANRT VAA YLT ARYAENE KN PH P VI YRVHGAP QME KV 485 

Query: 617 S D L VE F CD QMGL PMD VS SAG ALNKSLTKTFGDDKYSLARKEVLTNMYSRPM 667 

L F ++G + + G. AL + LKG+ L+VL RM 
Sbjct: 486 QVLAS FVRKIGFDLKLDRKGKDSATVSSKALRELLQKVRGTNVEFLVNELVL RSM 540 

Query: 668 QMALYFCSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLL 713 

A+Y L D H+ L YTHFTSPIRR+ D+IVHR+L 

Sbjct: 541 SKAVY SPLNDG HFGLGFEHYTHFTSPIRRYPDLIVHRIL 579 

>gi | 15672924 | ref | NP_267098 . 1 | ribonuclease [Lactococcus lactis subsp. 
lactis] 



gi| 182 02798 |sp|Q9CH00|RNRl_LACLA Ribonuclease R 1 (RNase R 1) {VacB 
protein homolog 1) 

gi | 25298855 | pir | | F86742 ribonuclease [imported] - Lactococcus lactis 
subsp. lactis (strain 

IL1403) 

gi | 12 723 878 | gb | AAK0504 0 . 1 | AE0063 28_8 ribonuclease [Lactococcus lactis 
subsp. lactis] 

Length = 817 

Score = 206 bits (525) , Expect = 9e-52 

Identities = 137/421 (32%), Positives = 223/421 (52%), Gaps = 27/421 (6%) 

Query: 315 GQLAKSLGQAGEIEPETEGILTEYGTOFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRK 3 74 

G + + +GQA + . + +L + S + F EVL+ + + + P +P +E+ R D R 

Sbjct: 207 GL VTE I IGQADDQG IDVLE VLASMD I - VSE F PKE VLD - QAEAVPEE VPENE I VGRVD YRN 264 

Query: 375 DCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYL 434 

+ FTID + A+DLDDA+ +RL +G +E+GVHIADVS++V E S LDK A ER TSVY+ 
Sbjct: 265 EITFTIDGADAKDLDDAVHAKRLENGNYELGVHIADVSHYVTENSPLDKEAYERGTSVYV 324 

Query: 43 5 VQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDH 4 94 

+WPMLP L +CSLNP ++LT S + +++PEG+++ + + II + + + ++YD 

Sbjct: 325 TDRWPMLPERLSNG1CSLNPRINRLTQSCVMEISPEGRVINYQISQSIIKTTERMTYDA 384 

Query: 495 AQSMI ENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHS IAKQLRRQRFVDGALRLDQLKL 554 

MI + + + + + LH I + +R++R GA+ D ++ 
Sbjct: 385 VNQMIAGDEAALENYA KI AD S VKI MVELHH I LEAMRKRR GAIDFDTVE- 432 

Query: 555 AFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQTK 614 

A + +E GLP + +++E FML AN VA F + R H P+ 

Sbjct: 433 AKIIVNEKGLPIEIRKRTRGIAERMIESFMLEANETVATH-FEAHGLPFIYRIHEQPKAD 491 

Query: 615 MLSDL VE FCDQMGL PMDVS S AGALNKS L TKTFGDDKYS LARKE VL TNMYS R PMQMAL YFC 674 

L ++F G+ ++ +S G +++ + + F + VL+ M R MQ A Y 

Sb j ct : 4 92 RLQRFIDFAATFGMQIEGTSNG- I DQKVLQAFMKKI KGQPGEMVL STMLLRSMQQARY - - 54 8 

Query: 675 SGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVEPDTLQKQA 734 

+ H+ L YTHFTSPIRR+ D++VHRL+ +G + P + LQK 
Sbjct: 549 S ENNEGHFGLAAENYTHFTS P I RRYPDLLVHRLI - RE I GEGKTP ANILQKWE 599 

Query: 735 D 735 

Sbjct: 600 D 600 

>gi | 21754656 | dbj | BAC04542 . 1 | unnamed protein product [Homo sapiens] 
Length = 750 

Score = 205 bits (521) , Expect = 3e-51 

Identities = 133/383 (34%), Positives = 197/383 (51%), Gaps = 32/383 (8%) 

Query: 265 LFS PSDHRVPRI YVPLKDCP - - QDFMTRPKDFANTLF I CR I IDWKEDCNFALGQLAKSLG 322 

L +P D+R+P-fl + + QDF " + RI W+ + G + LG 

Sbjct: 366 LVTPWDYRIPKIRISTQQAETLQDFRV WRIDSWESTSVYPNGHFVRVLG 415 

Query: 323 QAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSL P WT I P PDE VGKRRDLRKD - CI F 378 

+ G++E E IL E + FS + +P + PW + P+E KR+DLRK +F 

Sbjct: 416 RIGDLEGE I AT I LVENS I S VI PFS EAQMCEMPVNTPES PWKVS PEEEQKRKDLRKSHLVF 475 

Query: 3 79 TIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKV 43 8 

+IDP D+DD L+ R L +G E+GVHIADV++FV S +D A RAT+ YL + 
Sbjct: 476 SIDPKGCEDVDDTLSVRTLNNGNLELGVHIADVTHFVAPNSYIDIEARTRATTYYLADRR 535 



Query: 43 9 VPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEG- KILEEWFGRTI IRSCTKLSYDHAQS 4 97 

MLP +L +LCSL D+ S++W+L +1 + W+GRTIIRS KL Y+ AQ 

Sbjct: 536 YDMLPS VLSADLCS LLGGVDRYAVS IMWELDKAS YE I KKVWYGRT 1 1 RSAYKLFYEAAQE 595 



Query: 498 MIENXXXXXXXXXXXXXXXXHS VEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLK 553 

+ + + S +EE+ A+ L IA+ +R +R GAL L+ + + 

Sbjct: 596 LLDGNLSVVDDIPEFKX>LGEKSRQAKLEELVWAIGKLTDIARHVRAKRDGCGALELEGVE 655 

Query: 554 LAFTLD HETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRH 608 

+ LD H+ Q ++E V E M+LAN VA KI + + FP QALLR+H 

Sbjct: 656 VCVQLDDKKNIHDLIPKQPLEVHE TVAECMIIANHWVAKKIWESFPHQALLRQH 709 

Query: 609 PPPQTKMLSDLVEFCDQMGLPMD 631 

PPP + S + L E G +D 

Sbjct: 710 PPPHQEFFSELRECAKAKGFFID 732 



Score = 59.3 bits (142), Expect = 3e-07 

Identities = 33/82 (40%), Positives = 51/82 (62%), Gaps = 8/82 (9%) 

Query: 50 FETYMSKEDVSEGLKRGTLIQGVLRINPKKFH - EAF I PSPDGD- -RDIFIDGWA 101 

+ ++ E + G+K G IQG+L +N + EAF+ S D D DI I G+ A 

Sbjct: 225 YP EHL PLE VLEAG I KS GR Y I QG I LNVNKHRAQ I E AF VRLQGAS S KD S DLVS DILI HGMKA 284 

Query: 102 RNRALNGDL VWKLL P ED Q WKA 123 

RNR+ ++GD+VW+LLP+ + +WK 
Sbjct: 285 RNRSIHGDWWELLPKNEWKG 306 

>gi | 23020108 |gb | ZP_00059814 . 1 ( hypothetical protein [Clostridium 
thermocellum ATCC 274 05.3 
Length =757 

Score = 203 bits (516) , Expect = le-50 

Identities = 153/510 (30%), Positives = 245/510 (48%), Gaps = 50/510 (9%) 

Query: 232 JCWYILEKK HSRAATGILKLLADKNSDLF KKYALFSPSDHRVP-RIYVPLK 281 

+V+ + KK RA I+K++ N + K + P D R+ I+ + 
Sbjct: 108 R VI AR INKKG IGDKRAEGE 1 1 KI VKRANKT WGTFES SKYFGF WPDD PR I SGD I F I 164 

Query: 282 DC PQDFMTRPKDFANTLFI CR 1 1 DWKEDCNFALGQLAKS LGQAGE I EPETEGI LTE YGVD 341 

P+D + K+ +I+WE A G++ + +G E + 1+ Y + 

Sbjct: 165 - -PKDEINGAK- - SGQKVVAEI VVWPEKRRNAEGRIIEIIGDKDEPGSDILSI IKAYNLR 220 

Query: 342 FSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGT 401 

DF EV+ +S+ T+ D + RRDLR + TID A+DLDDA++ RL +G 

Sbjct: 221 -EDFPEEVIR-EAKSISQTVTEDMIKGRRDLRDLTMVTIDGEDAKDLDDAVSIERLPNGN 278 

Query: 402 FEVGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLT 461 

+ +GVHIADVSY+ V EGS LDK A +R TSVYLV +V+PM P+ L +CSLNP D+L 
Sbjct: 279 YRLGVHIADVSYYVKEGSPLDKEALKRGTSVYLVDRVIPMFPKELSNGICSLNPKVDRLA 338 

Query: 462 FSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVE 521 

F+V+ ++ G++++ ++I +++Y ++E H+++ 

Sbjct: 339 FTVMME I DKSGR WDHE I FES VINVNERMTYTD VYKI LEENDEGL I ER YKYLCDTFHTMK 398 

Query: 522 EVHQAVLNLHS IAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVE 581 

E +A LR++R GA+ + + LD E G+P YE +N + + E 

Sbjct: 399 E IALILRKKRMDRGAIDFNFDEAKIVLD-EKGVPIEVKRYEMTIANNIIE 447 

Query: 582 EFMLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKS 641 

EFML+ NVA FT +RHP + EF +G+ +NK 
Sbjct: 448 EFMLVCNETVAEHFFWT-NTPFVYRIHEDPDPDKIEAFSEFVHNLGYTLK GINKI 501 

Query: 642 LTKTFGD- -DKYSLARKE- VLTNMYSRPMQMALY- -FCSGMLQDQEQFRHYALNVPLYTH 696 

K D +K .++E + + + + R +Q A Y SG H+ L Y H 

Sbjct: 502 HPKALQD VLE KARGTKEET IIS TVMLRS LQKAR YS H INSG HFGLAAKYYCH 552 



Query: 697 FT S P I RRFAD VI VHRLLAAALG YS E QPDVE 726 



FTSPIRR+ D+I+HR++ L + P+ E 
Sbjct: 553 FTSPIRRYPDLIIHRIMKEYLKGTVNPERE 582 



>gi | 16801605 | ref |NP_471873 . 1 | similar to exoribonuclease RNase-R [Listeria 
innocua] 

gi | 25298866 |pir | | AB1750 exoribonuclease RNase-R homolog lin2543 [imported] 
- Listeria 

innocua (strain Clipll262) 
gi | 16415065 j emb | CAC97770 . 1 | similar to exoribonuclease RNase-R [Listeria 
innocua] 

Length =7 93 
Score = 201 bits (512), Expect - 3e~50 

Identities = 138/430 (32%), Positives = 219/430 (50%), Gaps = 39/430 (9%) 

Query: 313 ALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDL 372 

AG+ +G + + 1+ ++G+ + F EV+E + + + P + ++G RRDL 
Sbjct: 195 ARG WKS I IGHRNDPGVD I LS 1 1 HKHG I S IA - FPEEVMEQ VS KA - PD WDDS D I GNRRDL 252 

Query: 373 RKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSV 432 

R I TID + A+DLDDA+ ++L +G +++GVHIADV+++V EGS LDK A +R TSV 
Sbjct: 253 RDQMIITIDGADAKDLDDAVTVKQLPNGNWKLGVHIADVTHYVTEGSPLDKEAQDRGTSV 312 

Query: 433 YLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSY 4 92 

YLV +V+PMLP L +CSLNP D+ T S ++ EG + + ' + II + + + ++Y 
Sbjct: 313 YLVDRVI PMLPHKLSNGI CS LNPQVDRFTMS CEME I DQEGHWNHE I FES 1 1 KTTERMT Y 372 

Query: 4 93 DHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQL 552 

+ + + + + +A+ LRR+R GA+ D 

Sbjct: 373 TDVNDILVEKDEALREKYAPIVPMLEAMQH LAE I LRRKRE KRGAIDFD - F 421 

Query: 553 KLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQ 612 

KA + E G P+ + E L+EEFML AN VA F + R H P+ 

Sbjct: 422 KEARVWDEEGHPEEWMRERSAGEHLIEEFMLAANETVAEH-FHWMDVPFIYRIHEDPK 480 

Query: 613 TKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALY 672 

L+ EF GL + K +D + A ++VL + +P +M + 

Sbjct: 481 EDKLARFFEFITNFGL I VKGT AND I H P AAL QQ VL E E VKGKP E E MW - 526 

Query: 673 FCSGMLQDQEQFR HYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDV 725 

+ ML+ +Q + H+ L+ YTHFTSPIRR+ D+IVHRL+ Y DV 

Sbjct: 527 - S TVMLRSMQQAKYDT VSAGHFGLS TDF YTH FTS P I RRY PDL I VHRL I RE YLINGDV 582 

Query: 726 EPDTLQKQAD 735 

P+TL+K+A+ 
Sbjct: 583 RPETLEKRAE 592 

>gi | 16804487 | ref |NP_465972 . 1 | similar to exoribonuclease RNase-R [Listeria 
monocytogenes EGD-e] 

gi | 25298867 |pir | | AI1380 exoribonuclease RNase-R homolog lmo2449 [imported] 
- Listeria 

monocytogenes (strain EGD-e) 
gi | 16411937 | emb | CAD00527 . 1 | similar to exoribonuclease RNase-R [Listeria 
monocytogene s ] 

Length = 793 

Score = 200 bits (509) , Expect - 7e-50 

Identities = 138/430 (32%), Positives = 218/430 (50%), Gaps = 39/430 (9%) 

Query: 313 ALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDL 3 72 

A G + +G + + 1+ ++G+ F EV+E + ++ P + ++G RRDL 
Sbjct: 195 ARGWKS I IGHRNDPGVDILS I IHKHGIS IG- FPEEVMEQVSKA- PDLVDDSD I GNRRDL 252 

Query: 3 73 RKDCIFTIDPSTARDLDDALACRRLTDGTFEVGWIADVSYFVPEGSSLDKVAAERATSV 43 2 

R I TID + A+DLDDA+ ++L +G +++GVHIADV+++V EGS LD A ER TSV 



• 



Sbjct: 253 RDQM 1 1 T I DGADAKDLDDAVTVKQL PNGNWKLGVH I AD VTH YVTEGS PLD I EAQERGT S V 312 

Query: 433 YLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSY 492 

YLV +V+PMLP L +CSLNP D+ T S . ++ EG ++ +II++ + ++Y 

Sbjct: 313 YLVDRVI PMLPHKLSNGICSLNPQVDRFTMSCEMEIDQEGHWNHEI FES I IKTTERMTY 372 

Query: 493 DHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQL 552 

++ + + + ++A+ LRR+R GA+ D 

Sbjct: 373 TDVNDILVEKDEALRBKYAPIVPMLEAMQ NLAEILRRKREKRGAIDFD- F 421 

Query: 553 KLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTFPEQALLRRHPPPQ 612 

K A + EGP+ + E L+EEFML AN VA F + R H P+ 

Sbjct; 422 KEAR WVDEDGH PEA WMRER S AGEHL I E E FMLAANE T VAEH - FHWMD VPF I YR I HED PK 480 

Query: 613 TKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVLTNMYSRPMQMALY 672 

L+ EF GL + K +D + A ++VL + +P +M + 

Sbjct: 481 EDKLARFFEFITNFGL IVKGTANDIHPAALQQVLEEVKGKPEEMW- 526 

Query: 673 FCSGMLQDQEQFR HYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDV 72 5 

+ ML+ +Q + H+ L+ YTHFTSPIRR+ D+IVHRL+ Y DV 

Sbjct: 527 -STVMLRSMQQAKYDTVSAGHFGLSTDFYTHFTSPIRRYPDLIVHRLIRE YLINGDV 582 

Query: 726 EPDTLQKQAD 735 

P+TL+K+A+ 
Sbjct: 583 RPETLEKRAE 592 

>gi | 21397569 | ref |NP_653554 . 1 | RNB , RNB-like protein [Bacillus anthracis 
A2 012] 

Length = 808 
Score = 197 bits (501), Expect = 7e-49 

Identities = 133/414 (32%), Positives = 214/414 (51%), Gaps = 22/414 (5%) 

Query: 300 ICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPW 359 

+ +1 + E+ A G++ + LG + + ++++++ F EV+E S + P 
Sbjct: 185 WKITSYPENRLSAEGEVIQILGHKKDPGVDILSVIHKHHLPLA-FPEEVME-HANSVPE 242 

Query: 3 60 TIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGS 419 

TI +++ RRDLR ' I TID + A+DLDDA+ +L +G +++GVHIADVS++V EGS 
Sbjct: 243 TISEEDLKDRRDLRDQMIVTIDGADAKDLDDAVTVTKLENGNYKLGVHIADVSHYVQEGS 3 02 

Query: 420 SLDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWF 4 79 

+D AAERATSVYLV +V+PM+P L +CSLNP D+LT S + + G'+h 
Sbjct: 303 PIDVEAAERATSVYLVDRVIPMIPHRLSNGICSLNPKVDRLTLSCEMEINNLGDWKHEI 362 

Query: 480 GRTIIRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRR 539 

+++I++ + ++Y +S++E+ E + + +A+ LR 

Sbjct: 363 FQSVIKTTERMTYADVRSILEDEDEELMKRY EPLVPMFKEMGQLAQILRE 412 

Query: 540 QRFVDGALRLDQLKLAFTLDHETGLPQGCH I YEYRDSNKIjVEEFMLLANMAVAHKI FRTF 599 

+R GA+ DKALEGP ++ S KL+EEFML+AN VA F 

Sbjct: 413 KRMRRGAIDFD-FKEAKVLVDEEGKPTDVVMRDRSVSEKLIEEFMLVANETVAEH-FHWM 470 

Query: 6 00 PEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVL 6 59 

+ RHP+ L EF G + + ++L + + + V+ 

Sbjct: 471 NVPFMYRVHEDPKEDKiERFFEFVTNFGYAVKGRANEVHPRALQQIL-EMVQGQPEEVVI 529 

Query: 6 60 TNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTS P I RRFADVI VHRLL 713 

+ + R M+ A Y D + H+ L+ YTHFTSPIRR+ D IVHRL+ 

Sbjct: 530 STVMLRSMKQARY DADSLGHFGLSTEFYTHFTSPIRRYPDTIVHRLI 576 

>gi | 16080414 | ref |NP_391241 . 1 [ similar to hypothetical proteins [Bacillus 
subtilis] 

gi|7674332|sp|03223l|RMR_BACSU Ribonuclease R (RNase R) (VacB protein 
homo log) 



gi | 7443068 |pir| |G70027 conserved hypothetical protein yvaJ - Bacillus 
subtil is 

gi | 2635874 |emb|CAB15366,l| similar to hypothetical proteins [Bacillus 
subtilis] 

Length = 779 
Score =* 195 bits (495) , Expect = 3e-48 

Identities = 131/418 (31%), Positives = 213/418 (50%), Gaps = 22/418 (5%) 

Query: 300 ICRIIDWKEDCNFALGQhAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPW 359 

+ ++ + E. A G++ LG + + + + ++G+ . + F ++ +E S P 
Sbjct: 181 WKLTSYPEGRMNAEGEVETILGHKNDPGIDILSVIHKHGLP-GEFPADAME-QASSTPD 238 

Query: 360 TIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVH1ADVSYFVPEGS 419 

TI ++ RRDLR I TID + A+DLDDA+ +h DG++++GVHIADVS++V E S 
Sbjct: 23 9 TIDEKDLKIDRRDLRDQVIVTIDGADAKDLDDAVTVTKLDDGSYKLGVHIADVSHYVTENS 298 

Query: 420 SLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWF 479 

+DK A ER TSVYLV +V+PM+P L +CSLNP D+LT S + +G+ + E 

Sbjct: 299 PIDKEALERGTSVYLVDRVIPMIPHRLSNGICSLNPKVDRLTLSCEMTINSQGQVTEHEI 358 

Query: 480 GRTI IRSCTKLSYDHAQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHS IAKQLRR 53 9 

+++Y + + + 1 E + ++ +A+ LR 

Sbjct: 359 FQSVIKTTERMTYSDVNKILVD DDEELKQKYEPLVPMFKDMERLAQILRD 408 

Query: 540 QRFVDGALRLDQLKIiAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVAHKIFRTF 599 

+R QA+ DKAL+G + IE + KL+EEFML+AN VA F 
Sbjct: 409 KRMDRGAVDFD-FKEAKVLVCDEGAVKDWIRERSVAEKLIEEFMLVANETVAEH-FHWM 466 

Query: 600 PEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSLARKEVL 659 

+ R H P + L +EF G + +AG ++ - + + D + V+ 

Sbjct: 467 NVPFI YRIHEEPNAEKLQKFLEFVTTFGYWK- GTAGNIHPRALQS I LDAVRDRPEETVI 525 

Query: 660 TNMYSRPMQI^LYFCSGMLQDQEQFRHYALlWPLYTHFTSPIRRFADVIVHRLLtAAAL 717 

+ + r M+ A Y D + H+ L+ YTHFTSPIRR+ D+IVHRL+ L 

Sbjct: 526 S T VML RS MKQAKY DPQSLGHFGLSTEFYTHFTSPIRRYPDLI VHRLIRTYL 576 

>gi 1 158940-03 | ref |NP_347352 .1 | FUSION ribonuclease and ribosomal protein SI 
domain [Clostridium 

acetobutylicum] 

gi|2529886l|pir| |A96988 FUSION ribonuclease and ribosomal protein SI 

domain [imported] - 

Clostridium acetobutylicum 
gi|l5023595|gb|AAK78692.l|AE007586_9 FUSION ribonuclease and ribosomal 
protein SI domain [Clostridium 

ace tobutyl i cum] 
Length =73 0 

Score = 190 bits (483) , Expect = 7e-47 

Identities - 128/420 (30%), Positives - 213/420 (50%), Gaps = 35/420 (8%) 

Query: 300 ICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPW 359 

+ I W + G++ + +G GE + 1+ ++ + E +E + +P 

Sbjct: 177 VAEIT VWPKKRRNPEGKI VE I IGSKGEKGVDILTT I KKH- - KLPEKFPEKVERFAEGIPN 234 

Query: 3 60 TIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGS 419 

IP E +RRD+R + TID A+DLDDA++ +L++G F++GVHIADVS +V E + 
Sbjct: 235 E I PE S EYKRRRD I RD VKMVT I DGEDAKDLDDAVS I EKLSNGNFKLG VH I AD VSNYVREDN 294 

Query: 42 0 SLDKVAAERATSVYLVQKWPMLPRLLCEELCSLNPMTDKLTFSVIWKLTPEGKILEEWF 479 

LDK A +RATSVYL+ +V+PMLP+ L +CSLNP D+L S + + P GK+++ 
Sbjct: 295 PLDKSlALKlUyrSVYLIDRVIPMLPKKLSNGICSL 354 

Query: 480 GRTI I RSCTKLSYDHAQSM I ENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHS IAKQLRR 539 

+ H + + + ++Y + + + + E ++ N+ +A L + 
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Sbjct: 355 FE S 1 1 KTNERMTYTDVTKI LRD HDEETIKAFEYLYDDFKNMEELASILNK 404 

Query: 540 QRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEFMLLANMAVA-HKIFRT 598 

+R + GA+ D + TL+ E G P YE +N+++EEFML+ N +A H + 

Sbjct: 405 KRLLRGAIDFDFEESKITLN-ELGKPVEVKPYERAVANRIIEEFMLVCNETIAEHFYWAN 463 

Query: 599 FPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSS AGALNKSLTKTFGDDKYSLAR 655 

P +RHP++L EF +G+ S L +KG + 
Sbjct: 464 IP - -FVYRVHEEPDSEKLERFNEFIHNLGYAVRWGSEVHPKQLQDVIEKIKGKKE 516 

Query: 656 KEVLTNMYSRPMQMALYF- -CSGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLL. 713 

+ V++ + R ++ A Y CSG H+ L Y HFTSPIRR+ D+I+HR++ 

Sbjct: 517 ETWSTLLLRSLKQARYSPECSG HFGLAARYYCHFTSPIRRYPDLI IHRIM 567 

>gi | 27467483 | ref |NP_764120 . 1 | ribonuclease R [Staphylococcus epidermiclis 
ATCC 12228] 

gi | 27315026 |gb | AAO04162 . 1 | AEO 16745_2 61 ribonuclease R [Staphylococcus 
epidermidis ATCC 12228] 
Length = 792 

Score = 190 bits (483) , Expect = 8e-47 

Identities = 132/430 (30%), Positives - 215/430 (50%), Gaps = 26/430 (6%) 

Query: 315 GQLAKSLGQAGEIEPETEGILTEYGVDFSDFSSEVLECLPQSLPWTIPPDEVGKRRDLRK 3 74 

G ++ LG + + 1+ ++G++ +F +VL+ + +P I P E+ RRDLR 
Sbjct: 199 GHVSAILGHKNDPGVD I LS 1 1 YQHGIE I - EFPDDVLQ - EAEEVPDVIEPSE I EGRRDLRD 256 

Query: 3 75 DCIFTIDPSTARDLDDALACRRLTDGTFEVGVHIADVSYFVPEGSSLDKVAAERATSVYL 434 

+ TID + A+DLDDA+A ++L +G E+ V IADVSY+V EGS+LDK A +RATSVYL 
Sbjct: 257 ELTITIDGADAKDLDDAIAVKKLKNGNTELTVSIADVSYYVKEGSALDKEAYDRATSVYL 316 

Query: 435 VQKWPMLPRLLCEELCSLNPMTDECLTFSVIWKLTPEGKILEEWFGRTIIRSCTKLSYDH 494 

V +V+PM+P L +CSLNP D+LT S ++ G+ + + + ++I S +++YD 

Sbjct: 317 VDRVIPMIPHRLSNGICSLNPEEDRLTLSCRMEINERGEWKHEIFDSVIHSNYRMTYDA 376 

Query: 495 AQSMIENXXXXXXXXXXXXXXXXHSVEEVHQAVLNLHSIAKQLRRQRFVDGALRLDQLKL 554 

+1 + ++ ++ +L R R G + D + 

Sbjct: 377 VNKI I TDQDS E I RSQYKDLTPMLDLAQD LSNRLIRMRKRRGEIDFD- INE 425 

Query: 555 AFTLDHETGLPQGCHIYEYRDSNKliVEEFMLLANMAVAHKIFRTFPEQAL'LRRHPPPQTK 614 

A L ++ G+P + E + +L+E FML AN VA F + R H P+ + 

Sbjct: 426 AKVL VNDE G I P TE VLMRERGEGERL I E S FMLAANE T VAEH - FNKLE VPF I YR VHE QPKS D 484 

Query: 615 MLSDLVEFCDQMGLPMDVSSAGALNKSLTKTFGDDKYSIiARKEVLTMMYSRPMQMAIjYFC 674 

L +F G+ M + ++ + + ++ + V++ M R MQ A Y 

Sbjct: 485 RLRQFFDFITNFGI -MIKGTGEDIHPTTLQNIQEEVEGRPEQMVISTMMLRSMQQAHY- - 541 

Query: 675 SGMLQDQEQFRHYALNVPLYTHFTSPIRRFADVIVHRLLAAALGYSEQPDVE PDTL 730 

D H+ L+ YTHFTSPIRR+ D+ VHRL+ L + E +TL 

Sbjct: 542 DDWLGHFGLSAEYYTHFTSPIRRYPDLTVHRLIRKYLIENSMDKKEIRHWEETL 596 

Query: 731 QKQADHCNDR 740 

+ A+H + R 
Sbjct: 597 PELAEHTSQR 606 

Database: All non-redundant GenBank CDS 
trans lations+PDB+SwissProt+PIR+PRF 

Posted date: Jan 29, 2003 9:26 AM 
Number of letters in database: 423,161,544 
Number of sequences in database.- 1,321,324 

Lambda K H 

0.319 0.136 0.399 



Gapped 



r Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 681,347,984 

Number of Sequences: 1321324 

Number of extensions: 28649337 

Number of successful extensions: 62317 

Number of sequences better than 10.0: 242 

Number of HSP's better than 10.0 without gapping: 232 

Number of HSP's successfully gapped in prelim test: 10 

Number of HSP's that attempted gapping in prelim test: 

Number of HSP's gapped (non-prelim): 402 

length of query: 87 0 

length of database: 423,161,544 

effective HSP length: 132 

effective length of query: 738 

effective length of database: 248,746,776 

effective search space: 183575120688 

effective search space used: 183575120688 
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